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Explanation-based  learning  (EBL)  is  a  learning  method  which  uses  existing  knowledge  of  the 
domain  to  construct  an  explanation  for  why  a  specific  example  is  a  member  of  a  concept  or  why  a 
specific  combination  of  actions  achieves  a  goal.  This  explanation  is  then  generalized  in  an  analytical 
manner  in  order  to  produce  a  general  concept  description  or  plan  schema.  Although  a  number  of 
exploratory  EBL  systems  w'hich  operate  in  particular  domains  have  previously  been  constructed, 
recent  research  in  this  area  has  lead  to  the  development  of  general  mechanisms  w'hich  can  perform 
explanation-based  learning  in  a  wide  variety  of  domains. 

This  thesis  describes  a  general  EBL  mechanism.  EGGS,  which  can  make  use  of  declarative 
knowledge  stored  in  the  form  of  horn  clauses,  rewrite  rules,  or  STRIPS  operators.  Numerous 
examples  are  presented  illustrating  its  application  to  a  wide  variety  of  domains,  including  "blocks 
world"  planning,  logic  circuit  design,  artifact  recognition,  and  various  forms  of  mathematical 
problem  solving.  The  system  is  shown  to  improve  its  performance  in  each  of  these  domains. 


EGGS  has  been  most  thoroughiy  tested  as  a  component  of  a  narrative  understanding  system. 
GENESIS,  which  improves  its  own  performance  through  learning.  GENESIS  processes  short  English 
narratives  and  constructs  explanations  for  characters'  intentional  behavior.  When  the  system 
detects  that  a  character  has  achieved  an  important  goal  by  combining  actions  in  an  unfamiliar  way, 
EGGS  is  used  to  generalize  the  specific  explanation  for  how  the  goal  was  achieved  into  a  general 
clan  schema.  The  resulting  schema  is  then  retained  by  the  system  and  indexed  into  its  existing 
know  ledge-base.  This  schema  can  then  be  used  to  process  narratives  which  were  previously  beyond 
•.he  system's  capabilities.  The  thesis  also  discusses  GENESIS'  ability  to  learn  meanings  for  words 
related  to  its  learned  schemata  and  reviews  several  recent  psychological  experiments  which 
demonstrate  that  GENESIS  can  be  productively  interpreted  as  a  cognitive  model  of  certain  types  of 
human  learning. 
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PREFACE 


Artificial  Intelligence  is  a  young  science  whose  methodology  is  still  developing.  Their 
currently  exist  a  number  cf  relatively  distinct  methodological  perspectives  or  paradigms  [KuhnTO] 
in  Al  research.  Hall  and  Kibler  [HallSf],  have  proposed  the  following  taxonomy  of  methodological 
perspectives  m  AI:‘ 

•  Performance  Al  Oriented  towards  constructing  systems  with  impressive  levels  of  performance 

in  a  particular  domain. 

•  Constructive  Al  Oriented  towards  learning  general  principles  of  intelligence  by  designing, 

constructing,  and  experimenting  w-ith  systems  operating  in  complex  domains. 

•  Formal  Al  Oriented  towards  formal  specification  of  general  problems  and  mathematical 

proofs  of  the  correctness  and  computational  complexity  of  algorithms  for 
solving  these  problems. 


•  Speculative  Al  Oriented  towards  developing  a  theory  of  human  cognition  by  introspection  and 

construction  of  Al  systems.  Interested  in  global,  ecologically  valid  tasks  w'hich 
do  not  allow  for  detailed  empirical  validation. 

•  Empirical  Al  Oriented  towards  developing  a  theory  of  human  cognition  by  constructing  Al 

systems  that  closely  model  empirical  data.  Interested  in  well-defined  problems 
which  allow  for  detailed  empirical  validation. 

The  research  presented  in  this  thesis  does  not  strictly  adhere  to  any  one  of  these  perspectives  and 
cuts  across  the  ooundaries  of  many  of  them. 

Since  it  has  involved  constructing  and  experimenting  with  a  large  software  system,  the 
research  presented  here  can  be  seen  as  adhering  to  the  constructive  approach.  In  this  vein,  the  thesis 
attempts  to  provide  a  detailed  description  of  the  system  and  presents  numerous  concrete  examples 
of  its  operation  anc  empirical  data  on  its  performance. 

However,  this  thesis  aiso  presents  precise  descriptions  of  the  underlying  algorithms  used  in 

the  system  and  in  several  cases  mathematically  analyzes  the  computational  time  complexity  of 

these  algorithms.  Consequently,  it  can  also  be  seen  as  exhibiting  aspects  of  the  formal  approach. 

‘This  .s  no:  near.:  to  in.piy  that  Hall  and  Kioier  eiaim  that  all  research  in  Al  car.  ?e  pigeonholed  i.rio  one  of  t.nese 
caiegcr-.es.  Ir.  lac:,  they  explicitly  acinou’iecge  tre  "...tendency  on  T.e  par*,  o:  -esearcners  anc  issues  *.o  shill  quite  iree.y 
oe"ieer.  perspectives..."  p.  1  "3  . 
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Also  characteristic  of  the  formal  approach  is  the  use  of  predicate  caiculus  as  an  underlying 
representation  language. 

This  research  also  maintains  an  interest  in  human  cognition.  The  development  of  the  system 
was  motivated  and  guided  by  intuitions  about  human  learning  Characteristic  of  the  speculative 
approach,  the  task  of  narrative  understanding  is  used  as  the  primary  domain  of  application. 
Narrative  understanding  is  a  more  global  [LehnertS4]  or  ecologically  valid  task  than  well-defined 
tasks  such  as  the  eight-puzzle. 

However,  the  psychological  plausibility  of  the  system  is  not  supported  only  by  intuition  and 
anecdotal  evidence.  Empirical  evidence  from  psychological  experiments  is  used  to  support  the 
claim  that  the  system  exhibits  behavior  similar  to  that  found  in  human  subjects.  Therefore,  this 
research  also  has  features  of  the  empirical  approach. 

In  summary,  this  thesis  attempts  to  illustrate  that  a  uniform  and  coherent  piece  of  AI 
research  can  productively  employ  aspects  of  many  of  the  current  methodological  perspectives  in 
artificial  intelligence.  It  is  the  author's  sincere  hope  that  more  researchers  in  AI  will  learn  to 
appreciate  the  advantages  of  the  different  approaches  instead  of  slavishly  adhering  to  a  particular 
methodological  mind-set. 


'*  A  Cl  -V..V  .V  .VAA.V.V  .V  d  ■*>. 


TABLE  OF  CONTENTS 


*  w 


I 

( 

■ 


V 


6 

r. 

5? 

yv 

•i- 


CHAPTER 

1  INTRODUCTION  . 

1.1.  EGGS:  A  General  Explanation-Based  Learning  Mechanism  . 

1.2.  GENESIS:  Acquiring  Schemata  for  Narrative  Understanding  . 

1.3.  Historical  Comments  . 

1.4.  Organization  of  the  Thesis  . 

1.5.  Comments  on  Notation.  Implementation,  and  Figures  . 

2  SAMPLE  GENESIS  PERFORMANCE . 

3  EGGS:  A  DOMAIN  INDEPENDENT  EXPLANATION  GENERALIZER  . 

3.1.  Explanations.  Explanation  Structures,  and  Generalized  Explanations  . 

3.2.  Explanation  Generalizing  Algorithms  . 

3.3.  Comparison  of  Explanation  Generalizing  Algorithms  . 

3.4.  Computational  Complexity  of  Explanation  Generalization  . 

3.5.  Correctness  of  Explanation  Generalizing  Algorithms . 

3.6.  Pruning  Explanations  for  Operationalitv  . 

3.7.  Integrating  Explanation  Construction.  Pruning,  and  Generalization . 

3.S.  Explanation  Generalization  Versus  Chunking  and  Production  Composition 

• 

4  EGGS:  LOGICAL  PROOF  EXPLANATIONS . 

4.1.  Facilities  in  EGGS  Supporting  Logical  Proof  Explanations . 

4.2.  Logical  Proof  Examples  in  Several  Domains  . 

5  EGGS:  REWRITE  EXPLANATIONS  . 

5.1.  Rewritings  as  Explanations:  The  LEAP  Example . 

5.2.  Facilities  in  EGGS  Supporting  Rewrite  Explanations  . 

5.3.  Rewrite  Examples  . 

6  EGGS:  PLAN  EXPLANATIONS  . 

6.1.  Plans  as  Explanations:  The  STRIPS  Example  . 

6.2.  Facilities  in  EGGS  Supporting  Plan  Explanations  . 

6.3.  Plan  Examples . 


•  EGGS:  EMPIRICAL  RESULTS  ON  THE  EFFECT  OF  LEARNING 

'.1.  Performance  Improvement  . 

"  2.  Performance  Degradation  . 

"  3.  A  Suggestion  for  Better  Controlled  Exrer.ments . . 

S  GENESIS.  SYSTEM  OVERVIEW  . 


AAA 


CHAPTER 


5.1.  The  Parser  and  Lexicon . 

5.2.  The  Understander.  Schema  Library,  and  Causal  Model  . 

5.3.  The  Question- Answerer . 

5.4.  The  Paraphraser  . 

5.5.  The  Generator  and  V'ocabularv  . 

5.6.  The  Schema  Learner . 

5.7.  The  Word  Learner  . . 

S.S.  The  Indexer  Retriever  and  the  Long  Term  Store  . 

S.9.  The  Importance  of  the  Complete  System  . 

9  GENESIS:  SCHEMA  REPRESENTATION  . 

9.1.  Objects . 

9.2.  Attributes  and  States . 

9.3.  Actions  . 

10  GENESIS:  NARRATIVE  UNDERSTANDING . 

10.1.  Schema  Selection  . 

10.2.  Causally  Connecting  Actions  Without  a  Schema . 

10.3.  Limiting  Search  in  Understanding . 

1 1  GENESIS:  SCHEMA  ACQUISITION  . 

11.1.  Deciding  When  to  Learn  . 

11.2.  Pruning  the  Explanation  . 

11.3.  Generalizing  the  Explanation  . . 

11.4.  Packaging  the  Explanation . 

11.5.  Using  the  Learned  Schema  . 

12  GENESIS:  LEARNING  WORD  MEANINGS  . 

12.1.  A  Problem  with  Previous  Models  of  Learning  Word  Meanings . 

12.2.  Learning  Role  Labels  . 

12.3.  Learning  Schema  Labels  . 

12.4.  Comments  on  the  Integrated  Learning  of  Words  and  Concepts  . 

13  GENESIS:  LEARNED  SCHEMATA  AS  RETRIEVAL  INDICES  . 

14  THE  PSYCHOLOGICAL  STATUS  OF  EXPLANATION-BASED  LEARNING 

14  1.  Overview  of  the  Experiments . 

14.2.  Experimentl:  Abstract  Description  Generation  . 

14.3.  Experiment  2:  New  Instance  Generation  . 

14.4  Experiment  3:  Yes.  No  Questions  . 

14.5.  Discussion  of  the  Experimental  Results  . 

15  CONCLUSIONS  AND  FUTURE  WORK  . 

15.1.  Relation  to  Other  Work  . 


X 


CHAPTER 


15.2.  Problems  for  Future  Research 

15.3.  Conclusions  . 


APPENDIX  A  LINEAR  SUBSTITUTION  APPLICATION 


APPENDIX  B  ADDITIONAL  GENESIS  EXAMPLES 

B.l  The  Kidnap  Example  . 

B.2.  The  Arson  Example  . 

B.3.  The  Solicit  Example . 


APPENDIX  C  INITIAL  GENESIS  KNOWLEDGE 


APPENDIX  D  DETAILED  GENESIS  TRACE  FOR  THE  MURDER  EXAMPLE 


REFERENCES 


VITA 


ms&s&ssgm 


t 


v. 

tz 


1 


B 


R 

| 

'  \ 


Ai 


u 


CHAPTER  1 
INTRODUCTION 


Over  the  past  several  years,  research  in  machine  learning  has  included  an  ever  increasing 
number  of  projects  in  the  area  of  explanation-based  learning  (EBL)  [DeJongS6b.  Mitchell86].  Previ¬ 
ously.  research  in  machine  learning  was  predominantly  concerned  with  what  have  been  called 
similarity-based  [LebowitzS6j  or  empirical  [LanglevS6]  learning  methods.  Prototvpically.  a 
similarity-based  learning  (SBL)  method  involves  determining  a  concept  definition  by  examining  a 
large  number  of  examples  and  counter-examples  of  a  concept  and  searching  for  a  simple  description 
that  includes  all  of  the  examples  while  excluding  all  of  the  counter-examples  (see  [DietterichS2. 
DietterichSu.  Mitcbell82]  for  overviews  of  such  methods).  A  critique  of  these  methods  as  well  as 
numerous  replies  to  this  critique  can  be  found  in  [Schank86b].  In  contrast  to  this  approach,  an 
explanation-based  method  uses  its  existing  knowledge  of  the  domain  to  explain  why  a  single  exam¬ 
ple  is  a  member  of  a  concept  and  then  analytically  generalizes  this  explanation  to  determine  a 
description  of  the  general  concept.  A  discussion  of  the  advantages  and  disadvantages  of  this 
approach  can  be  found  in  [Mitchell86]  and  a  review  of  work  in  EBL  is  presented  in  [ElImanST], 
Some  interesting  remarks  on  the  relation  between  SBL  and  EBL  are  also  presented  in  [Dietterich86], 

This  thesis  presents  a  general  domain-independent  explanation-based  learning  mechanism  that 
has  been  tested  on  numerous  examples  from  various  domains.  In  addition,  it  discusses  how  this 
learning  system  has  been  integrated  with  a  narrative  understanding  mechanism  to  form  a  complete 
system  that  learns  new  plan  schemata  from  English  narratives  and  uses  these  schemata  to  improve 
its  understanding  ability.  Some  recent  psychological  experiments  that  indicate  people  can  learn  a 
schema  by  explaining  and  generalizing  a  single  instance  presented  in  a  narrative  are  also  briefly- 
reviewed.  Therefore,  an  overall  claim  of  the  entire  thesis  is  that  a  general  mechanism  for 
explanation-based  learning  can  be  used  to  improve  the  performance  of  a  narrative  understanding 
system  in  a  psychologically  plausible  manner. 

1.1.  EGGS:  A  General  Explanation-Based  Learning  Mechanism 

EGGS  is  a  general  E3L  system  that  has  been  tested  on  a  variety  of  examples  from  the  litera¬ 
ture  on  explanation-based  learning.  Many  of  these  examples  were  originally  used  to  demonstrate 
earlier,  more  domain-dependent  systems  such  as  STRIPS  [Fikes72],  LEX2  [VI itchel  18 3 ] .  CUPS 
[ W;nstonS3].  MA  [0'RorkeS4j.  and  LEAP  [Mitchell85],  The  examples  from  these  systems  come 
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from  a  number  of  different  domains  including  "blocks  world"  planning,  integration  problem  solv¬ 
ing,  learning  artifact  descriptions,  proving  theorems  in  logic,  and  designing  logic  circuits.  Neverthe¬ 
less.  the  same  learning  system  is  used  to  generalize  explanations  for  aii  of  these  examples.  EGGS 
also  includes  performance  systems  that  produce  explanations  for  the  learning  system  and  in  turn 
use  the  rules  it  generates  to  improve  their  ability  to  solve  future  problems. 

EBG  [MitchellSb]  is  an  alternative  domain-independent  technique  for  generalizing  explana¬ 
tions  that  was  independently  developed  at  the  same  time  EGGS  was  originally  designed  and  imple¬ 
mented.  The  eventual  implementation  of  EBG  [Kedar-CabelliS/a],  has  also  been  tested  on  several 
of  the  domains  mentioned  above.  All  of  the  original  systems  listed  above  used  similar  generaliza¬ 
tion  techniques:  however,  until  the  development  of  EGGS  and  EBG.  there  was  not  a  general  learn¬ 
ing  technique  that  could  be  used  to  generalize  examples  in  all  of  these  domains.  Explanation  gen¬ 
eralization  is  also  a  very  computationally  efficient  process.  As  demonstrated  in  this  thesis,  explana¬ 
tion  generalization  can  theoretically  be  performed  in  time  linear  in  the  size  of  the  explanation. 

EBL  requires  knowledge  of  the  underlying  theory  of  a  domain  before  it  can  learn  more  useful 
knowledge  for  efficiently  solving  problems  in  that  domain.  Consequently,  a  general  EBL  mechan¬ 
ism  that  can  learn  in  a  variety  of  domains  requires  what  has  been  called  an  exchangeable  knowledge 
module  [Michalski56].  In  EGGS,  this  is  accomplished  by  having  both  the  generalizer  and  perfor¬ 
mance  systems  make  use  of  a  library  of  declarative  knowledge  that  is  uniformly  represented. 
Currently.  EGGS  can  rewrite  rules,  or  STRIPS  operators.  This  set  of  representational  choices  has 
allowed  the  efficient  implementation  of  a  numerous  examples  from  various  domains. 

Finally,  previous  EBL  systems  that  learn  plans  or  macro-operators  have  generally  been  used 
to  learn  linear  sequences  of  actions.  EGGS  is  unique  in  that  it  contains  a  mechanism  for  learning 
plans  for  partially-ordered  sequences  of  actions.  This  process  results  in  more  general  plans  which 
are  applicable  to  a  wider  range  of  future  situations. 

1.2.  GENESIS:  Acquiring  Schemata  for  Narrative  Understanding 

GENESIS  is  a  narrative  understanding  system  that  improves  its  performance  by  using  the 
EGGS  learning  system  to  acquire  an  abstract  schema  from  a  specific  instance  of  a  novel  plan 
presented  in  an  English  narrative.  As  such,  it  tests  the  capacity  of  the  EGGS  learning  system  to 
improve  the  processing  ability  of  a  complete  performance  system  operating  in  a  complex  domain. 

Narrative  text  understanding  was  chosen  as  a  performance  domain  for  the  follow  ing  reasons. 
The  ability  to  "understand"  natural  language  text  is  a  very  difficult  task  requiring  a  large  amount 
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of  world  knowledge.  Systems  for  understanding  natural  language  text  (e.g.  [Cullingford?&. 
DeJongSZb.  DyerS3])  generally  encode  relevant  world  knowledge  m  terms  of  scripts  or  schemata 
[Schank77],  The  amount  of  world  knowledge  represented  in  terms  of  schemata  largely  determines 
the  performance  of  such  a  system.  Experience  with  the  FRUMP  system  [DeJongS2b]  indicated  that 
robustness  of  a  text  understanding  system  is  directly  related  to  the  number  of  schemata  it 
possesses.  However,  anticipating  and  encoding  all  of  the  schemata  required  for  a  robust  natural 
language  system  is  impossible  for  both  theoretical  and  practical  reasons.  Theoretically,  texts  can 
display  novel  concepts  unknown  to  the  implementors  of  a  natural  language  system.  If  the  natural 
language  system  is  to  respond  properly,  it  must  discover  such  new  concepts  automatically.  Practi¬ 
cally.  the  number  of  schemata  required  to  cover  most  natural  language  domains  is  prohibitively 
large  and  prevents  manual  programming  of  all  of  the  necessary  concepts.  Once  again,  automatic 
schema  acquisition  is  essential. 

Of  course,  narrative  understanding  is  not  unique  in  this  regard.  Most  realistic  performance 
tasks  require  large  amounts  of  domain  knowledge  represented  in  a  form  that  can  be  efficiently  used 
to  solve  problems:  and  constructing  such  a  knowledge-base  is  known  to  be  a  difficult  task.  In  fact, 
most  work  in  machine  learning  is  at  least  indirectly  addressing  the  problem  of  the  "knowledge 
acquisition  bottleneck”  [Feigenbaum83]  in  the  construction  of  knowledge-based  systems.  However, 
narrative  understanding  has  the  advantage  that  issues  can  be  illustrated  with  examples  that  are 
accessible  to  everyone,  not  just  to  individuals  who  have  advanced  degrees  in  medicine,  geology,  or 
computer  engineering. 

Another  interesting  aspect  of  this  domain  is  that  explanation-based  learning  has  generally- 
been  used  to  improve  the  performance  of  a  problem-solving  system  rather  than  an  understanding 
system.  GENESIS  is  unique  in  that  it  demonstrates  that  EBL  can  be  used  to  improve  a  system's 
ability  to  explain  observed  behavior  as  well  as  its  ability  to  solve  problems  efficiently. 
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1.3.  Historical  Comments 

An  original  implementation  of  GENESIS  was  reported  in  [MooneyS5b]  (shorter  versions 
include  [MooneySfla]  and  [MoonevS6a]).  This  version  employed  a  relatively  ad  hoc  explanation 
generalizing  system  that  was  specific  to  the  representations  used  by  the  narrative  understanding 
system  (see  [MoonevS5b]  for  details  on  this  generalizer).  However,  the  original  GENESIS  general- 
izer  was  similar  to  the  Lifting  process  used  to  learn  generalized  plan  MACROPS  in  STRIPS  [Fikes7Z] 
and  was  also  influenced  by  the  TMS-based  generalizer  used  by  the  MA  system  [0'RorkeS7b]. 
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An  attempt  to  convert  the  original  GENESIS  generalizer  into  a  domain-independent  explana¬ 
tion  generalizer  lead  to  the  development  of  the  EGGS  system.  The  generalization  algorithm  under¬ 
lying  the  EGGS  system  was  first  described  in  [DeJongS6b]  and  a  comparison  of  EGGS.  EBG.  and 
STRIPS  as  well  as  examples  of  applying  EGGS  to  numerous  domains  was  first  presented  in 
[MooneyS6c].  After  the  development  of  EGGS  was  complete,  an  entirely  new  version  of  the 
GENESIS  system  was  built  on  top  of  this  general  explanation-based  learning  system.  This  version 
of  GENESIS  is  described  in  detail  in  this  thesis.1 

1.4.  Organization  of  the  Thesis 

The  thesis  begins  with  a  sample  performance  of  the  GENESIS  system  that  illustrates  the  sort 
of  behavior  exhibited  by  the  complete  system.  The  next  four  chapters  discuss  various  components 
of  the  EGGS  system.  Chapter  3  discusses  the  domain-independent  explanation  generalizer  that  is 
used  to  generalize  all  of  the  examples  presented  in  the  thesis.  It  also  includes  a  discussion  on  the 
relation  between  the  explanation  generalization  algorithm  used  in  EGGS  and  the  generalizing  algo¬ 
rithms  used  in  STRIPS.  EBG.  and  SOAR  [LairdS6b]  as  well  as  an  analysis  of  the  computational 
complexity  of  explanation  generalization.  Chapters  4-6  discuss  additional  sub-systems  in  EGGS  for 
building  explanations  based  on  various  representations  and  present  numerous  examples  of  using 
EGGS  to  learn  in  various  domains.  Chapter  7  presents  a  summary  of  empirical  results  on  the  effect 
of  learning  on  future  problem  solving  performance. 

The  next  six  chapters  discuss  additional  components  in  the  complete  GENESIS  system. 
Chapter  S  presents  the  architecture  of  the  complete  system  and  briefly  discusses  the  role  of  each  of 
its  components.  Chapter  9  discusses  the  knowledge  representation  used  in  GENESIS  and  Chapter 
10  describes  the  processes  used  in  understanding  narratives.  Chapter  11  discusses  how  GENESIS 
uses  EGGS  to  generalize  explanations  constructed  by  the  understanding  system  and  illustrates  how 
the  schemata  it  learns  improve  its  ability  to  understand  subsequent  narratives.  The  final  two 
chapters  about  GENESIS  discuss  its  ability  to  learn  meanings  for  schema-related  words  from  a  sin¬ 
gle  instance  of  their  use  (Chapter  12)  and  to  use  learned  schemata  to  index  and  retrieve  specific 
episodes  (Chapter  13). 


'For  several  reasons,  this  new  implementation  is  not  called  GENESIS  II.  One  reason  is  to  avoid  association  with  Gene 
Rodder.berry's  faded  TV  pilot  of  The  same  name  [Gerrold73].  A  second  reason  .s  to  avoid  the  temptation  to  make  is  many 
versions  o;  GENESIS  as  There  were  sequels  to  Fridcy  tht  13th.  The  latter  benavior  has  beer,  known  to  provoke  satirical  com¬ 
ments  from,  some  stone-throwing  members  of  the  machine  learning  community  Ce.g.  the  reference  to  AQ6J  m  GanglySI],. 
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Chapter  14  reviews  some  psychological  experiments  that  indicate  explanation-based  learning 
is  psychologically  plausible.  Specifically,  these  experiments  demonstrate  that  people,  like  GENESIS, 
can  learn  a  schema  from  a  single  specific  example  presented  in  a  narrative  provided  they  have  the 
knowledge  to  explain  the  actions  in  the  example. 

Chapter  15  summarizes  the  unique  features  of  this  research  compared  to  other  work  in  EBL 
and  machine  learning  in  general  and  discusses  some  problems  and  directions  for  future  research. 

Finally,  there  are  a  number  of  appendices  that  provide  additional  details.  Appendix  A 
presents  a  lemma  needed  for  the  complexity  analysis  of  explanation  generalization.  Appendix  B 
presents  additional  examples  of  GENESIS'  performance  which  even  the  casual  reader  may  find 
interesting  to  peruse.  Appendix  C  lists  all  of  GENESIS'  initial  knowledge  and  appendix  D  gives  a 
detailed  GENESIS  trace. 

1.5.  Comments  on  Notation,  Implementation,  and  Figures 

Throughout  the  thesis,  the  LISP  convention  of  using  a  leading  question  mark  (e.g.  ?x23)  is 
used  to  denote  predicate  calculus  variables  [Chamiak85].  In  general,  standard  predicate  calculus 
notation  (e.g.  P(?x.?y))  is  preferred  to  LISP  notation  (e.g.  (P  ?x  ?y));  however,  for  convenience,  the' 
latter  is  sometimes  used,  particularly  in  the  appendices.  All  implementations  are  written  in 
INTERLISP  [Teitelman83]  and  were  developed  and  run  on  a  XEROX  1108  (Dandelion)  with  3.5  MB 
ot  main  memory.  All  explanation  graphs  presented  in  the  thesis  (including  accompanying  node 
labelling  tables  when  needed)  were  generated  automatically  by  the  system  using  Interlisp-D's 
GRAPHER  package  [Xerox86]  and  automatically  converted  to  TROFF  PIC  [Kemighan79]  and  TBL 
[Lesk79]  descriptions.  Only  minor  hand-editing  W'as  performed  to  reposition  nodes  for  more  com¬ 
pact  lay-out  and  to  insert  special  characters  (e.g.  /.  A.  =).  When  tables  are  used  to  fully  label 
nodes  in  large  explanation  graphs,  the  order  of  the  nodes  follows  a  depth-first,  left-to-rigbt  traver¬ 
sal  of  the  graph. 


CHAPTER  2 


SAMPLE  GENESIS  PERFORMANCE 


The  purpose  of  this  chapter  is  to  provide  the  reader  with  an  intuitive  feel  for  the  overall 
operation  of  the  system.  Throughout  this  thesis,  it  is  assumed  that  the  reader,  like  an  EBL  system, 
can  learn  a  great  deal  by  analyzing  and  understanding  a  single  concrete  example. 

The  primary  goal  of  the  GENESIS  system  is  to  improve  its  ability  to  understand  natural 
language  narratives  by  learning  new  plan  schemata.  Such  schemata  are  acquired  by  explaining  and 
generalizing  a  specific  instance  of  a  plan  executed  by  a  character  in  a  narrative.  Characters  actions 
are  explained  in  terms  of  later  actions  that  they  enable  and  in  terms  of  ultimate  goals  that  they 
achieve.  When  the  system  detects  that  a  character  has  achieved  an  important  goal  by  combining 
actions  in  a  novel  and  unfamiliar  way.  it  generalizes  the  specific  explanation  for  how  the  goal  was 
achieved  into  a  general  plan  schema.  Generalization  is  performed  by  an  analytic  technique  (EGGS) 
which  removes  irrelevant  information  while  maintaining  the  validity  of  the  explanation.  The 
resulting  schema  is  then  retained  by  the  system  and  indexed  so  that  it  can  be  subsequently- 
retrieved  and  used  to  aid  in  the  understanding  of  future  narratives. 

A  standard  procedure  is  used  to  test  GENESIS'  ability  to  learn  a  particular  schema  from  a  sin¬ 
gle  instance.  This  procedure  illustrates  both  the  schema  learning  process  itself  as  well  as  the  ability 
of  the  learned  schema  to  improve  system  performance.  First,  the  system  is  given  a  test  narrative 
which  presents  a  sparse  description  of  an  instance  of  the  schema.  This  description  is  missing  one  or 
more  actions  that  are  crucial  to  the  overall  plan.  Consequently,  the  narrative  is  not  detailed  enough 
for  the  system  to  construct  a  causally  complete  explanation  for  characters'  actions  without  a 
schema  to  supply  missing  actions  and  inferences.  The  system  is  therefore  unable  to  answer  ques¬ 
tions  that  require  making  default  inferences  about  what  must  have  taken  place  and  it  is  unable  to 
produce  an  adequate  paraphrase  of  the  narrative.  Next,  the  system  is  given  a  Learning  narrative 
which  describes  in  detail  a  complete  instance  of  the  schema  and  which  contains  the  crucial  actions 
and  other  information  that  were  lacking  in  the  test  narrative.  Using  its  existing  knowledge. 
GENESIS  is  able  to  construct  a  causally  complete  explanation  for  this  narrative  and  can  therefore 
answer  questions  about  why  actors  performed  certain  actions  as  well  as  produce  an  adequate  para¬ 
phrase  of  the  text.  Since  the  narrative  presents  a  novel  plan  that  achieves  an  important  goal,  the 
system  generalizes  its  explanation  of  how  the  goal  was  achieved  into  a  new  plan  schema.  Finally. 
GENESIS  is  given  the  test  narrative  again.  This  time,  it  can  use  the  schema  it  has  just  learned  to  fill 


in  missing  information,  and  as  a  result,  it  can  nosx  answer  questions  that  it  was  previously  unable 
to  answer  as  well  as  produce  a  better  paraphrase  of  the  text. 

T.ne  remainder  of  this  chapter  is  dedicated  to  presenting  the  trace  produced  by  GENESIS  as  it 
learns  a  "kidnapping  for  ransom"  schema  [DeJong81.  DeJongSku.  DeJong86a.  DeJongS6b. 
MooneySSa.  MooneyS6a].  This  is  currently  the  most  complicated  example  upon  which  GENESIS 
has  been  tested.  A  complete  step-by-step  description  of  the  system's  performance  on  this  example 
is  not  included  in  this  thesis  since  it  would  necessarily  be  very  long  and  complicated  and  the 
important  processes  are  more  easily  illustrated  using  simpler  examples.  However,  more  detailed 
information  about  this  example  is  given  in  appendix  B  and  definitions  of  ail  of  GENESIS'  initial 
scnemata  are  given  in  appendix  C.  Chapters  9-11  include  a  complete  step-by-step  description  of 
how  GENESIS  learns  and  uses  a  "murder  for  inheritance"  schema  and  a  detailed  system  trace  of 
this  example  is  given  in  appendix  D.  Details  on  other  GENESIS  examples  such  as  "arson  for 
insurance"  and  “solicitation  entrapment"  are  also  given  in  appendix  B. 

Before  processing  the  narratives  used  in  the  kidnapping  example,  the  system  has  knowledge 
about  bargaining,  capturing  and  confining,  threatening,  and  many  other  concepts:  however,  it  does 
not  have  any  knowledge  about  the  concept  of  kidnapping  for  ransom  nor  any  knowledge  of  the 
words  "kidnap.”  "kidnapper."  or  “ransom."  In  order  to  illustrate  the  system's  current  ignorance 
of  kidnapping  and  how  this  lack  of  knowledge  affects  its  ability  to  understand  certain  kidnapping 
narratives,  the  system  is  first  given  a  test  narrative.  Below  is  the  actual  I/O  trace  produced  by  the 
system. 

Input:  Ted  is  Alice's  husband.  He  won  100000  dollars  in  the  lottery.  Bob  imprisoned  Alice  in 
his  basement.  Bob  got  75000  dollars  and  released  Alice. 

Thematic  goal  achieved:  Ted  is  happy  that  Ted  has  the  S100000. 


Readv  for  cuestions: 


>  Summarize 

Ted  won  SlOOOOO  in  a  lottery.  Bob  captured  Alice  and  locked  her  in  a  basement.  Bob  releasee 
Alice. 


>  Whv  did  Bco  lock  Alice  m  his  basement? 


Cannot  find  sufficient  reason. 


>How  did  Bob  get  the  money? 

Cannot  find  sufficient  cause. 

>\Vhv  did  Bob  release  Alice 
Cannot  find  sufficient  reason. 

>  How  did  Ted  get  the  money? 

Ted  won  the  SIOOOOO  in  the  lottery. 

Notice  that  this  narrative  does  not  mention  how  Bob  got  the  money  or  why  Bob  imprisoned  Aiice 
and  then  released  her.  Since  the  system  does  not  have  a  schema  for  kidnapping,  it  cannot  infer  the 
missing  information  and  causally  connect  the  actions  in  the  input.  Consequently,  its  paraphrase  of 
the  narrative  is  very  poor  and  it  is  unable  to  answer  many  questions  about  it.  Next.  GENESIS  is 
given  the  following  more  detailed  narrative: 

Input:  Fred  is  Mary's  father  and  is  a  millionaire.  John  approached  Mary  and  pointed  a  gun  at 
her.  She  was  wearing  blue  jeans.  He  told  her  if  she  did  not  get  in  his  car  then  he  would  shoot 
her.  He  drove  her  to  his  hotel  and  locked  her  in  his  room.  John  called  Fred  and  told  him  John 
was  holding  Mary  captive.  John  told  Fred  if  Fred  gave  him  250000  dollars  at  Trenos  then 
John  would  release  Mary.  Fred  paid  him  the  ransom  and  the  kidnapper  released  Mary. 
Vaierie  is  Fred's  wife  and  he  told  her  that  someone  had  kidnapped  Mary. 

Thematic  goal  achieved:  John  is  happy  that  John  has  the  S250000. 

Explanation  suitable  for  generalization.  Pruning. ..Generalizing. ..Packaging... 

Creating  New  Schema:  (CaptureBargain  nx55  °a34  7b9  ?c4  ?r5  ?v5  ?lll) 

'1b9  is  a  person.  ’’cJ  is  a  location.  ?r5  is  a  room.  ?c4  is  in  ?r 5.  7x55  is  a  character.  ?b9  is  free. 
nx55  captures  ',b9  and  locks  him/her  in  ?r5.  '1a34  is  a  character.  ?x5 5  contacts  1*334  and  tells  it 
that  '"’bQ  is  ')x55  s  captive,  1^5  is  a  valuable.  7x55  wants  to  have  ?y5  more  than  it  wants  °b9 
to  be  °x55's  captive.  ‘)a34  has  a  positive  relationship  with  ?b9.  ?a34  has  ~!y 5.  ''xSS  and  7a34 
carry  out  a  bargain  in  which  '*x55  releases  ?b9  and  '’aSh  gives  '7x55  ?y5  at  '’ll  1. 


V 

c- 


9 


Unknown  word  ‘ransom'  refers  to  7v5  in  CaptureBargain 
Unknown  word  'kidnapper'  refers  to  ?x55  in  CaptureBargain 

Having:  ?x55  captured  ?b9  suggest  CaptureBargain 

Having:  ?x55  contacted  7a34  and  told  it  that  ?b9  was  ?x55‘s  captive,  suggest  CaptureBargain 
Having:  7x55  and  ?a34  carried  out  a  bargain  in  which  ?x55  released  ?b9  and  ?a34  gave  ?x5 5  ?y 5 
at  '’HI.  suggest  CaptureBargain 

Lr.k.nown  word  '7x55  kidnap  ?b9'  refers  to  CaptureBargain 
Ready  for  questions: 

>  Summarize 

Mary  was  free.  John  captured  Mary  and  locked  her  in  a  room.  John  contacted  Fred  and  told 
him  that  Mary  was  John's  captive.  John  wanted  to  have  S250000  more  than  he  wanted  Mary 
to  be  John's  captive.  Fred  had  a  positive  relationship  with  Mary.  Fred  had  the  S2.5(XXX).  John 
and  Fred  carried  out  a  bargain  in  which  John  released  Mary  and  Fred  gave  John  the  $250000  at 
Trenos. 

>  ’«Vhy  did  John  approach  Mary7 
So  John  could  aim  the  gun  at  Mary. 

>  Why  did  John  aim  the  gun  at  Mary? 

So  John  could  threaten  to  shoot  Mary  with  the  gun  unless  Mary  went  from  John  into  the  car. 

>  Why  did  Mary  get  into  the  car? 

Because  Mary  wanted  not  to  be  shot  and  because  Mary  believed  that  if  Mary  did  not  go  from 
John  into  the  car  then  John  would  shoot  Mary  with  the  gun. 

>  Why  did  John  threaten  to  shoot  Mary  unless  she  got  in  the  car? 

So  John  could  drive  Mary  to  the  hotel  in  the  car. 

>  Whv  did  John  drive  Marv  to  the  hotel'’ 


So  John  could  move  Mary  from  the  hotel  into  the  room. 

>  Why  did  John  move  Mary  into  the  room? 

So  John  could  confine  Mary  in  the  room. 

>  Why  did  John  confine  Mary  in  the  room7 

So  John  could  call  Fred  and  could  tell  him  that  Mary  was  John's  captive  and  so  John  and  Fred 
could  carry  out  a  bargain  in  which  John  released  Mary  and  Fred  gave  John  the  S250000  at 
Trenos. 

>  Why  did  John  tell  Fred  that  John  was  holding  Mary  captive? 

So  John  and  Fred  could  carry  out  a  bargain  in  which  John  released  Mary  and  Fred  gave  John 
the  $250000  at  Trenos. 

>  Why  did  Fred  give  John  the  money? 

Because  Fred  believed  that  if  Fred  gave  John  the  S250000  at  Trenos  then  John  would  release 
Mary  and  because  Fred  wanted  Mary  to  be  free  more  than  he  wanted  to  have  the  S250000. 

>  Why  did  Fred  have  $250000? 

Because  Fred  had  millions  of  dollars. 

>  Why  did  Fred  believe  that  John  was  holding  Mary  captive? 

Because  John  called  Fred  and  told  him  that  Mary  was  John's  captive. 

>  Why  did  Fred  want  Mary  free7 
Because  Fred  was  Mary's  father. 

>Why  did  John  release  Mary7 

Because  John  wanted  to  have  the  $250000  more  than  he  wanted  Mary  to  be  John's  captive  and 
because  John  beiieved  that  if  John  released  Mary  then  Fred  would  give  John  the  $250000  at 
Trenos. 
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L’r.iike  tne  first  narrative.  this  one  is  detailed  enough  to  a  1  Ionx  GENESIS  to  causally  connect  the 
characters'  actions  Specifically,  the  crucial  bargain  that  took  place  between  the  kidnapper  and  the 
ransom  payer  is  alluded  to  more  directly.  As  a  result,  the  system  is  able  to  answer  numerous  ques¬ 
tions  about  why  certain  characters  performed  certain  actions.  The  understanding  mechanism  is  dis¬ 
cussed  in  detail  in  chapter  10.  and  the  specifics  of  understanding  this  particular  narrative  are  given 
in  appendix  B.  The  resulting  explanation  for  how  John  got  the  S250.000  is  generalized  into  a  new 
schema  for  kidnapping  for  ransom  (which  GENESIS  calls  CaptureBargain  based  on  the  names  of 
two  existing  schemata  that  compose  the  new  schema).  Chapter  11  discusses  the  schema  acquisition 
process  in  detail. 

A  few  important  aspects  of  the  learned  schema  should  be  pointed  out.  First,  it  does  not  con¬ 
tain  any  facts  or  actions  that  are  irrelevant  to  the  workability  of  the  plan.  For  example,  the  fact 
that  the  victim  is  wearing  blue  jeans  is  not  included.  A  similarity-based  learning  system  that  was 
g:'- en  a  number  of  examples  of  kidnapping  in  which  the  victim  was  always  wearing  blue  jeans  is 
liable  to  include  this  fact  in  its  representation  of  the  concept.  Second,  the  exact  manner  in  which 
the  component  pian  schemata  (e.g.  Capture)  were  decomposed  into  subgoals  and  executed  in  the 
example  is  not  included  in  the  schema.  For  example,  the  fact  that  the  kidnapper  executed  the  Cap¬ 
ture  by  threatening  the  victim  with  a  gun  and  then  driving  her  to  a  hotel  or  that  he  contacted  the 
ransom  payer  by  telephone  are  not  a  par:  of  the  CaptureBargain  schema.  Third,  the  generalization 
process  eliminates  facts  that  are  causally  relevant  to  the  plan  only  because  they  are  specializations 
of  more  general  facts.  For  example,  the  fact  that  the  ransom  paver  was  the  victim's  father  is 
relevant  to  the  plan  since  it  motivated  him  to  pay  the  ransom:  however,  the  more  important  fact  is 
the  more  general  one  that  he  had  a  positive  emotional  relationship  with  the  victim  and  conse¬ 
quently  vaiued  her  freedom  more  than  personal  possessions.  This  generalization  is  important  since 
the  schema  should  be  able  to  handle  alternative  relationships  between  the  ransom  payer  and  victim 
such  as  the  husband-wife  relationship  in  the  test  narrative. 

After  learning  CaptureBargain.  the  system  indexes  the  new  schema  so  that  if  it  subsequently 
encounters  a  narrative  in  which  someone  captures  someone  or  exchanges  someone's  freedom  for  a 
caluabie  item,  it  will  be  reminded  of  CaptureBargain  and  attempt  to  use  it  to  help  understand  the 
narrative.  Details  of  the  indexing  process  and  how  learned  schemata  can  aid  the  understanding  of 
future  narratives  are  also  given  in  chapter  11.  The  system  also  acquires  preliminary  definitions  for 
three  unknown  words  which  occurred  .r.  the  story:  kidnapper,  ransom,  and  kidnap.  GENESIS' 
ability  to  also  learn  word  meanings  from  a  single  example  is  discussed  in  chapter  12. 
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Finally.  GENESIS  is  given  the  test  narrative  once  again.  This  time.  Alice's  imprisonment  rem¬ 
inds  it  of  CaptureBargam  and  it  uses  this  newly  acquired  schema  to  infer  the  missing  actions  and 
explain  hew  Bob  got  the  money.  Consequently,  us  ability  to  answer  questions  and  paraphrase  the 
text  is  greatly  improved. 


Input:  Ted  is  Alice’s  husband.  He  won  100000  dollars  in  the  lottery.  Bob  imprisoned  Alice  in 
his  basement.  Bob  got  75000  dollars  and  released  Alice. 


Thematic  goal  achieved:  Ted  is  happy  that  Ted  has  the  S100000. 
Thematic  goal  achieved:  Bob  is  happy  that  Bob  has  the  S75000. 


Ready  for  questions: 


>  Summarize 

A.iice  was  free.  Bob  captured  Alice  and  locked  her  in  a  basement.  Bob  contacted  Ted  and  told 
him  that  Alice  was  Bob’s  captive.  Bob  wanted  to  have  S75000  more  than  he  wanted  Alice  to  be 
Bob's  captive.  Ted  had  a  positive  relationship  with  Alice.  Ted  had  the  S75000.  Bob  and  Ted 
carried  out  a  bargain  in  which  Bob  released  Alice  and  Ted  gave  Bob  the  S75000. 


>  Why  did  Bob  lock  Alice  in  his  basement'’ 

So  Bob  could  contact  Ted  and  could  teil  him  that  Alice  was  Bob’s  caotive  and  so  Bob  and  Ted 


could  carry  out  a  bargain  in  which  Bob  released  Alice  and  Ted  gave  Bob  the  S75000. 


>  Why  did  Bob  release  Alice'’ 

Because  Bob  wanted  to  have  the  S75000  more  than  he  wanted  Alice  to  be  Bob’s  captive  and 
because  Bob  believed  that  if  Bob  released  Alice  then  Ted  would  give  Bob  the  S75000. 


>  How  did  Bob  get  the  money'1 
Bob  kidnapped  Alice. 


>  Who  gave  Bob  the  money’’ 
Ted  gave  Boo  the  S75GOQ 
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>  Why  did  Ted  give  him  :he  money? 

Because  Ted  believed  that  if  Ted  gave  Bob  the  S75000  then  Bob  would  release  Alice  and 
oecause  Tec  wanted  Alice  to  be  free  more  than  he  wanted  to  have  the  $75000 


>  Why  did  Ted  want  Alice  free0 
Because  Ted  was  Alice's  husband. 

>  Why  did  Ted  believe  that  Bob  was  holding  Alice  captive? 

Because  Bob  contacted  Ted  and  toid  him  that  Alice  was  Bob's  captive. 


Whiie  answering  questions  about  this  narrative  for  the  second  time,  it  is  interesting  to  note  that  the 
system  used  the  newly  learned  word  "kidnap"  to  refer  to  the  CaptureBargain  schema  in  its  answer 
to  the  question:  "How  did  Bob  get  the  money?"  This  is  because  the  state  in  question  is  an  effect  of 
the  new  schema  and  since  it  knows  how  to  refer  to  this  schema  in  English,  it  considers  it  to  be  an 
annrorriate  answer. 


CHAPTER  3 


EGGS:  A  DOMAIN'  INDEPENDENT  EXPLANATION  GENERALIZER 


EGGS  is  a  general  domain  independent  explanation-based  learning  system.  In  addition  to 
being  the  underlying  learning  mechanism  in  the  GENESIS  system,  it  has  been  tested  in  numerous 
other  domains  using  various  underlying  representations  such  as  Horn-clause  logic,  rewrite  rules, 
and  STRIPS  operators.  An  abstract  outline  of  the  learning  process  in  EGGS  is  given  in  Figure  3.1 
and  an  architectural  diagram  of  the  system  is  given  in  Figure  3.2. 

The  tasks  of  constructing  an  explanation  (step  1)  and  packaging  the  generalized  explanation 
for  future  use  (step  4)  both  depend  on  the  underlying  representational  formalism.  Each  representa¬ 
tional  formalism  requires  different  modules  for  these  tasks.  For  example,  when  using  Horn  clauses, 
a  theorem  prover  is  appropriate  for  constructing  explanations,  while  when  using  STRIPS  operators, 
a  planner  is  appropriate.  Each  of  the  following  three  chapters  is  dedicated  to  a  different  representa¬ 
tional  formalism  and  discusses  modules  within  EGGS  for  building  and  packaging  explanations 
using  that  representation.  Examples  of  using  each  representation  in  various  domain  are  also  given 
in  these  chapters. 

Uniike  explanation  construction  and  packaging,  explanation  generalization  (step  3).  can  be 
characterized  in  a  very  general  way  and  is  discussed  in  detail  in  this  chapter.  The  general  task  of 
pruning  the  structure  of  the  explanation  to  increase  generality  while  maintaining  operationally 
:step  2  !  is  also  characterized  in  this  chapter;  however,  specific  rules  for  pruning  are  domain  depen¬ 
dent  and  are  discussed  in  following  chapters. 


3.1.  Explanations,  Explanation  Structures,  and  Generalized  Explanations 

In  different  domains,  various  types  of  explanations  are  appropriate.  In  [MitchellS6  j.  an  expla¬ 
nation  :s  uehned  as  a  logical  proof  that  demonstrates  how  an  example  meets  a  set  of  sufficient  con¬ 
ditions  defining  a  particular  concept.  This  type  of  explanation  is  appropriate  for  learning  ciassica. 
-creep:  definitions,  such  as  learning  a  structural  specification  of  a  cup.  an  example  introduced  m 
•.'’•Vir.stor.S3;  and  discussed  in  [\litchell&6].  However,  when  learning  general  plans  in  a  problem 
solving  domain  As  in  STRIPS  [Fikes’2]  or  GENESIS  [Mooney85a]).  ;t  is  more  appropriate  to  con¬ 
sider  an  exriar.ation  to  be  a  set  of  causallv  connected  actions  that  demonstrate  how  a  goal  state  is 
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A  unit  is  a  connected  directed  acyciicai  graph  (V.  E)  in  which  the  vertices  in  V  are  wffs 


For  examrie.  a  unit  for  a  Horn-clause  rule  has  wffs  for  its  antecedents  and  its  conseouent.  while  a 


unit  for  a  STRIPS  operator  has  wffs  for  its  effects,  deletions,  and  preconditions.  A  wff  a  in  a  unit  is 
said  to  support  another  wff  b  in  the  unit  if  and  only  if  there  is  a  directed  path  from  c  to  b.  For 
example,  in  the  unit  for  a  Horn  clause,  each  antecedent  supports  the  consequent  through  a  path  con¬ 


taining  a  single  edge. 


A  domain,  theory.  T.  is  formally  dehned  as  a  set  of  units.  As  defined  in  [XilssonSO],  a  substitu¬ 
tion  is  a  set  of  ordered  pairs  each  specifying  a  term  to  be  substituted  for  a  particular  variable.  The 
expression  p9.  where  p  is  a  wff  and  9  is  a  substitution,  denotes  the  wff  resulting  from  applying  6  to 
p.  The  expression  yd.  where  both  y  and  9  are  substitutions,  denotes  the  substitution  resulting  from 
the  composition  of  y  and  9.  which  is  obtained  by  applying  9  to  the  terms  of  y  and  then  adding  any 
pairs  of  0  having  variables  not  occurring  among  the  variables  of  y.  An  instance  of  a  unit.  a.  is  a 
unit  obtained  by  applying  a  variable  substitution  to  all  of  the  wffs  in  a.  Two  wffs  are  said  to  be 
identical  if  and  only  if  all  of  their  corresponding  predicates,  functions,  variables,  and  constants  are 
exactly  the  same  (i.e.  their  most  general  unifier  is  the  null  substitution).  Before  formally  defining 
an  explanation  in  this  representation,  a  few  additional  definitions  are  needed. 


A  unit-set  is  a  cair  (U.  R)  where  L  is  a  set  of  units:  ((V,.  E.).  ...  (V  .  E  )!  and  R  is  an 
equivalence  relation  defined  on  the  set  of  wffs:  V  U  V,  ...  U  V..  For  each  pair  of  wffs  ( a  .b) 
in  R  where  a  €  V  and  o  €  V..  it  must  be  the  case  that  that  i  ^  i  (i.e.  equivalent  wffs  must  be 


from  seDarate  units). 


Given  a  unit-set  S  =  (U,  R)  where  U  =  !(V  E,).  ...  (V  ,  E  )}.  let  C,.  C,.  ...  C  be  the  equivaler.ee 

classes  of  wffs  defined  by  R.  Let  G  be  the  graph  (V'.  E’)  where  V'  =  {C,,  C,.  ...  Cm!  and  (C.  € 

E’  if  and  only  if  there  are  wffs  a  €  C,  and  b  6  C;  such  that  (a.  b)  €  Es  U  E,  ...  U  E^.  G  is  referred 
to  as  the  graph  of  S  and  represents  the  directed  graph  obtained  by  "collapsing"  all  equivalent  wffs 
into  a  single  vertex.  A  formal  definition  of  an  explanation  can  now  be  stated  as  follows. 


An  explanation  is  a  unit-set.  S  =  (U.  R).  where  the  graph  of  S  is  connected  and  acyclic  and 
where  for  each  pair  of  wffs  (a,  b)  €  R.  a  and  b  are  identical.  Furthermore,  let  the  set  L"  C  U 
be  the  set  of  all  units  in  U  that  are  instances  of  units  in  the  domain  theory.  T.  and  let  R’  be  the 
equivalence  relation  such  that  (a.  b)  €  R'  if  and  only  if  both  a  and  b  are  wffs  from  units  in  L" 
and  (a.  b)  €  R.  In  order  for  S  to  be  an  explanation.  U  must  be  nonempty  and  the  graph  of  the 


unit-set  S'  =  (L".  R’  1  must  also  be  connected  and  acvclic. 
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In  other  words,  an  explanation  is  a  combination  of  units  that  forms  an  even  larger  connected  acv- 
cl’.c  graph  by  means  of  an  an  equivalence  relation  defined  on  their  vertices  Each  pair  of  wffs  that 
the  relation  defines  as  equivalent  must  be  identical  Furthermore,  if  all  units  that  are  not  instances 
of  units  in  the  domain  theory  are  removed  from  an  explanation,  the  remaining  explanation  also 
defines  a  connected  directed  acyclic  graph.  The  goal  is  a  distinguished  wff  in  the  explanation  that  is 
a  sink  of  the  graph  of  the  explanation  and  represents  the  final  conclusion  in  an  inference  chain  or 
the  desired  state  in  a  plan. 

A  Horn-clause  proof  in  this  representation  is  an  explanation  whose  units  are  Horn  clauses  and 
whose  equivalence  relation  matches  antecedents  of  some  clauses  to  consequents  of  others.  In  this 
case,  an  explanation  is  analogous  to  the  data  dependency  structure  maintained  by  a  truth  mainte¬ 
nance  system  [Doyle79].  For  example,  consider  the  following  domain  theory  for  the  problem  of 
learning  a  structural  definition  of  a  cup.  an  example  originally  presented  in  [WinstonS3], 

Stable(°x)  A  Liftable(?x)  A  OpenVessel(?x)  —  CupOx) 

Bottomry)  A  PartOf(?v.?x)  A  Flat(?v)  —  Stable(?x) 

GraspableOx)  A  LightC’x)  —  Liftable(?x) 

Handle(?v)  A  Part0f(7v.?x)  —  Graspablef^x) 

Concavity(?y)  A  PartOf(?y.?x)  a.  UpwardPointmg(?vJ  —  OpenVessel(?x) 

Additional  units  needed  for  the  problem  are  the  following  individual  facts. 

Light(Objl).  Color(Objl.Red).  PartOf(Handlel.Objl).  Handle(Handlel).  Bottom(Bl). 

PartOf(Bl.Objl).  rlat(Bl).  Concavity(Cl).  PartOfCC! .Objl),  UpwardPointing(Cl) 

A  proof  tree  for  Cup(Objl)  is  shown  in  Figure  3.3  as  an  explanation  whose  goal  is  Cup(Objl).  Tri- 
✓ 

pie  edges  in  the  graphs  indicate  equivalences  between  wffs  in  two  units  that  are  instances  of  the 
domain  theory  while  double  edges  indicate  equivalences  to  wffs  in  units  that  are  not  instances  of 
units  in  the  domain  theory.  Specifically,  for  explanations  using  Horn  clauses  as  units,  triple  edges 
indicate  connections  between  instantiations  of  rules  from  the  domain  theory  while  double  edges 
indicate  connections  to  initial  facts  about  the  specific  example.  Examples  of  explanations  where  the 
units  include  STRIPS  operators  are  presented  in  chapter  6. 

A  wff  a  is  a  unionized  version  of  a  wff  b  if  and  only  if  a  is  obtained  by  substituting  a  uniquely 
named  variable  for  each  variable  in  b.  In  correspondence  with  the  terminology  in  [MitchellS6],  an 
explanation  structure  ;s  defined  as  an  explanation  with  each  instantiated  unit  from  the  domain 
theory  replaced  by  a  umquizec  ■. ersion  of  its  general  definition.  Formally: 


Graspable(Objl) 


Light(Objl) 


Handle(Hl) 

II 

Handle(Hl) 


PartOf(Hl.Objl) 

II 

PartOf(Hl.Objl) 


Figure  3.3:  Explanation  for  CupfObjl) 


An  explanation  structure  of  an  explanation  E  =  (U.  R)  is  a  unit-set.  S  =  (U  .  R')  where  for 
each  u.  €  U  where  ui  is  an  instance  of  a  unit  t(  €  T.  there  is  exactly  one  u '  €  L"  such  that  u  '  is 
a  uniquized  version  of  t.  and  where  (u u  ')  €  R'  if  and  onlv  if  (u  .  u  )  €  R. 

The  definition  of  an  explanation  insures  that  an  explanation  structure  defines  a  connected  directed 
acyclic  graph.  For  example,  the  explanation  structure  of  the  explanation  for  the  cup  example  is 
shown  in  Figure  3.4. 

The  task  of  explanation  generalization  is  to  take  an  explanation  and  generate  a  generalized 
explanation,  which  is  the  most  general  instance  of  its  explanation  structure  in  which  equivalent 
wffs  are  identical.  The  generalized  explanation  maintains  matches  between  wffs  from  rules  or  facts 
in  the  domain  theory  but  eliminates  matches  to  wffs  specifying  facts  of  the  particular  specific 
example.  This  means  that  the  most  general  substitution  that  results  in  all  equivalent  wffs  being 
identical  must  be  applied  to  the  explanation  structure.  Formally: 


S  N  -» 


(Cup  9xl) 


(Stable  9xl)  (Liftable  ?xl)  ( Open  Vessel  ?xl) 


III  III  III 

(Stable  9x2)  (Liftable  ?x3)  (Open Vessel  9x5) 


m 

(Graspable  ?x4) 


(Handle  ?y2)  (PartOf  ?y2  ?x4) 

Figure  3.4:  Explanation  Structure  for  the  Cup  Example 
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A  generalized  explanation  of  an  explanation  E  with  an  explanation  structure  S  =  (U.  R)  is  an 
explanation  G  =  (L“.  R')  such  that  there  exists  a  substitution  y  where  for  each  u;  €  L*  there  is 

exactly  one  u'  €  L“  such  that  U;’  =  uv  and  where  (u;'.  u,‘)  6  R'  if  and  only  if  (a.  u  )  €  R. 

Furthermore,  if  (u/.  u,")  €  R’.  then  u’  and  u(’  must  be  identical.  Finally,  for  any  other  substi¬ 
tution.  9.  satisfying  these  constraints,  there  must  exist  a  substitution  9'  such  that  L'0  =  L'y9' 
(this  insures  that  y  is  the  most  general  substitution  that  satisfies  the  constraints). 

The  generalized  explanation  of  the  cup  example  is  shown  in  Figure  3.6.  This  generalized  explana¬ 
tion  can  then  be  used  to  obtain  the  follow  ing  macro-rule  representing  a  general  structural  definition 
of  a  cup. 

BottomC’yl)  A  PartOf (?y lTxl )  A  Flat(9y  1 )  A  Hand!e^v2)  A  PartOf(?y2.?xl) 

A  LightPxl  j  Concavity(?y3)  A  PartOf(?v3.',xl )  'V  LpwardPointing(',y3)  —  CupC’xl) 

For  explanations  that  are  logical  proofs,  a  macro-rule  like  the  one  above  is  easily  obtained  by  tak¬ 
ing  the  leaves  of  the  generalized  explanation  as  the  antecedents  and  the  goal  of  the  generalized 

explanation  as  the  consequent.  In  planning  domains,  the  generalized  explanation  represents  a 


Stable(?xl) 


Liftabie(?xl) 


Stable(?xl) 


BottomCVl) 


Flat(?vl)  / 


Liftable(?x  1 )  Open 

,A  _ _ _ 

/  \  Concavitv(?v3) 


ODenVsssel(?xl ) 
III 

OpenVessel(?xl ) 


PartOf(?vl.?xl) 


7y3)  |  L'pwardPointing('1y3) 

PartOf(?v3.?xl) 


Graspable(?xl)  Light(?xl) 

III 

Graspable(?xl) 


Handle(?v2)  PanOf(?v2.?xl) 


Figure  3-5:  Generalized  Explanation  for  the  Cup  Example 


general  plan  schema  or  macro-operator  [Fikes72]  for  achieving  a  particular  ciass  of  goals.  Creating 
a  new  action  definition  for  the  composed  plan  from  the  generalized  explanation  requires  a  few  addi¬ 
tional  steps  which  are  discussed  in  section  6.2.2. 

3.2.  Explanation  Generalizing  Algorithms 

Several  algorithms  have  been  developed  for  generalizing  various  types  of  explanations.  The 
STRIPS  system  [Fikes72]  incorporated  a  method  for  generalizing  blocks-world  plans  into  macro¬ 
operators.  The  EBG  method  [Mi:chell86]  uses  a  modified  version  of  goal-regression  (\Valdmger7“] 
to  generalize  proofs  of  concept  membership.  The  EGGS  explanation  generalization  algorithm  was 
developed  for  generalizing  the  bread  ciass  of  explanations  defined  in  the  previous  section.  This  algo¬ 
rithm  was  first  published  in  [DeJongS6b]  along  with  a  description  of  an  error  found  in  the 
specification  of  the  EBG  algorithm.  Kedar-Cabelli  and  McCarty  subsequently  developed  a  PROLOG 
-ersior.  of  EBG  [Kedar-Cabelli57aj  which  corrected  this  problem  with  the  original  algorithm. 

The  general  technique  used  by  STRIPS.  EBG.  EGGS,  and  PROLOG-EBG  can  be  abstracted  to 
apply  :c  the  class  of  explanations  defined  in  the  previous  section.  The  rest  of  this  section  is  devoted 
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to  presenting  and  comparing  algorithmic  descriptions  of  all  of  these  methods  as  applied  to  this  class 
of  explanations.  All  of  the  algorithms  rely  on  unification  pattern  matching  and  the  abbreviation 
MGU  is  used  to  refer  to  the  substitution  that  is  the  most  general  unifier  of  two  wffs  [CharniakSO. 
XilssonSO].  All  of  the  generalization  algorithms  presented  have  been  implemented  and  tested 
within  the  context  of  the  overall  EGGS  system. 

3.2.1.  STRIPS  MACROP  Learning 

The  first  uork  on  generalizing  explanations  was  the  learning  of  robot  plans  in  STRIPS 
[Fikes72].  STRIPS  worked  in  a  "blocks  world"  domain.  After  its  problem  solving  component  gen¬ 
erated  a  plan  for  achieving  a  particular  state,  it  generalized  the  plan  into  a  problem  solving  schema 
(a  MACROP  or  macro-operator)  which  could  be  used  to  efficiently  solve  similar  problems  in  the 
future.  Work  on  the  STRIPS  system  was  the  first  to  point  out  that  a  correct  generalization  of  a 
connected  set  of  actions  or  inferences  can  not  be  obtained  by  simply  replacing  each  constant  by  an 
independent  variable.  This  method  happens  to  work  on  the  CUP  example  given  above.  The  proper 
generalized  explanation  can  be  obtained  by  replacing  Objl  by  Yd.  Bl  by  ?yl.  HI  by  7y2.  and  Cl  by 
°v3.  However,  in  general,  such  a  simplistic  approach  can  result  in  a  structure  that  is  either  more 
general  or  more  specific  than  what  is  actually  supported  by  the  system's  domain  knowledge. 

The  following  examples  are  given  in  [Fikes72]  to  illustrate  that  simply  replacing  constants 
with  variables  can  result  in  improper  generalizations.  These  exampies  assume  the  initial  state 
shewn  in  Figure  3.6  and  use  the  following  operators: 

GoThru(?d.?rl ,?r2):  Go  through  door  ?c  from  room  Yl  to  room  Y2. 

PushThru(?b.?d.?rl.?r2):  Push  box  ?b  through  door  ?d  from  room  ?rl  to  room  Y2. 
SpeciaiPushiTb):  Specific  operator  for  pushing  box  7b  from  Room2  to  Rooml. 

Given  the  plan: 

Go  Thru!  Door  I. Room  1  .Room2) 

Specia!Push(  Boxl ) 

simply  replacing  constants  by  variables  results  in  the  plan: 

GoThruOd.Yl.7r2) 

SpecialPushi',b) 

This  plan  :s  too  general  since  SpecialPusn  is  only  applicable  when  starting  in  Roorr.2.  so  having  a 
triable  Y2  as  the  destination  of  the  GoThru  is  too  general  and  Y2  should  be  replaced  by  Rcom2. 


GcThru(  Door  1  .Room  1  .Room2 ) 

PushTnruCBox  1  .Door  1  .Room2.Room  1 ) 

simply  replacing  constants  by  variables  results  in  the  plan: 

GoThru(?d.?rl.?r2) 

PushThru('7b.?d.'1r2.I’rl ) 

This  plan  is  too  specific  since  the  operators  themselves  do  not  demand  that  the  room  in  which  the 
robot  begins  (?rl )  be  the  same  room  into  which  the  box  is  pushed.  The  correct  generalization  is: 

GoThru('’d.'1rl  .'1r2) 

PushThru(',b.'>d.'>r2.?r3) 

The  exact  process  STRIPS  uses  to  avoid  these  problems  and  correctly  generalize  an  example  is 
dependent  on  its  particular  representations  (triangle  tabies)  and  inference  techniques  (resolution); 
however,  the  basic  technique  is  easily  captured  using  the  representations  discussed  in  section  3.1.  A 
description  of  the  basic  explanation  generalizing  algorithm  used  in  STRIPS  is  shown  m  Figure  3.7. 
It  should  be  noted  that  the  generalization  process  in  STRIPS  was  constructed  specifically  for  gen¬ 
era.  ;zmg  robot  plans.  There  was  no  attempt  to  present  a  general  learning  method  based  on  general- 
:.-:r.z  explanations  in  any  domain.  However,  the  algorithm  in  Figure  3.7  ;s  a  straight-forward  gen¬ 
eralization  of  the  basic  process  used  in  STRIPS.  The  basic  technique  is  to  unifv  eacn  pair  of 
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for  each  equality  between  wifs  x  and  y  in  the  explanation  structure  do 
let  0  be  the  \1GU  of  x  and  y 
for  each  wff  z  in  the  explanation  structure  do 
replace  z  with  z6 

Figure  3.7:  STRIPS  Explanation  Generalizing  Algorithm 


equivalent  wifs  in  the  explanation  structure  and  appiy  each  resulting  substitution  to  all  of  the  wifs 
in  the  explanation  structure.  After  all  of  the  unifications  and  substitutions  have  been  made,  the 
result  is  the  generalized  explanation  since  each  wff  has  been  replaced  by  the  most  general  wff  that 
allows  all  or  the  equality  matches  in  the  explanation  to  be  satisfied. 

3.2.2.  EBG 

In  [MitchellS6],  Mitchell.  Keller,  and  Kedar-Cabelli  describe  a  technique  called  EBG 
(Explanation-Based  Generalization)  for  generalizing  a  logical  proof  that  a  particular  example 
satisfies  the  definition  of  a  concept.  An  example  concept-membership  proof  showing  how  a  particu¬ 
lar  object  satisfies  the  functional  definition  of  a  cup  was  given  in  Figure  3.3.  Unlike  the  STRIPS 
MACROP  learning  method.  EBG  was  meant  to  be  a  general  method  for  learning  by  generalizing 
explanations  of  why  an  example  is  a  member  of  a  concept.  In  [MitchelI86],  detailed  examples  are 
presented  illustrating  how  EBG  can  be  applied  to  learning  an  operational  definition  for  when  it  is 
safe  to  stack  something  on  an  endtabie1.  to  Winston's  CUP  example  [Winston83].  and  to  an  exam¬ 
ple  from  LEX2's  domain  of  learning  heuristics  for  symbolic  integration  [MitchellS3].  A  much  more 
abstract  description  of  how  it  might  be  used  to  learn  a  kidnapping  plan  like  that  learned  by  the  ori¬ 
ginal  GENESIS  system  [Moonev85a]  is  presented  in  an  appendix. 

The  original  EBG  algorithm  presented  in  [Mitche!186]  is  based  on  goal  regression  [Wald- 
ingerTT]  and  involves  back-propagating  constraints  from  the  goal  through  the  explanation  struc¬ 
ture  to  the  leaves.  Figure  3.8  presents  a  formal  specification  of  the  original  algorithm  in  terms  of 
the  explanation  representation  introduced  earlier.  The  global  variable  R  maintains  the  current  set  of 
regressed  expressions  and  represents  the  most  general  set  of  antecedents  necessary  to  prove  the  goal 
given  the  portion  of  the  explanation  structure  already  traversed.  The  explanation  structure  is 
traversed  from  the  goal  back  to  the  leaves  in  a  depth  first  manner.  Each  time  a  unit  (rule)  is 
traversed,  the  set  R  is  undated  and  the  substitution  resulting  from  the  unit's  unification  to  tne 
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structure  already  traversed  is  applied  to  all  of  the  wffs  in  R.  After  the  entire  explanation  structure 
has  been  traversed.  R  is  the  most  general  set  of  antecedents  for  the  given  explanation  structure.1 

However,  as  initially  pointed  out  in  [DeJongS6b].  this  algorithm  is  only  guaranteed  to  deter¬ 
mine  the  leaves  of  the  generalized  explanation  and  in  certain  situations  fails  to  obtain  the  correct 
generalized  goal.  The  ‘Suicide''  example,  originally  introduced  in  [DeJong$6b]  is  an  example  for 
which  the  original  EBG  algorithm  does  not  compute  the  correct  generalized  goal  and  as  a  result 
learns  an  incorrect  macro-rule.  This  example  involves  inferring  that  an  individual  will  commit 
suicide  if  he  is  depressed  and  buys  a  gun.  The  specific  facts  of  the  problem  are: 


Depressed(John).  BuvCJohn.Objl ).  Isa(Objl.Gun) 


i  he  domain  rules  are: 


Deoressed(?x)  —  Hate(?x.?x) 


HateC’x.'V)  A  Possess(?x.°z)  A  Isa( ?z. Weapon)  -*  Kiil(?x.?y) 
Buy('1x.',y)  —  Possess(°x.?v).  IsaC’x.Gun)  —  Isa(?x. Weapon) 


‘T.re  algorithm,  presented  in  Figure  3.S  corrects  problems  with  the  BackPropagate  function  presented  :n  [Mooney 'db. 
Mcor.ey«6cj.  As  discussed  ir.  [Mooneyabbj.  the  BackPropagate  function  (which  was  based  or.  the  informal  description  of  this 
process  giver.  :n  [Mitc.neII86]j  does  not  properly  propagate  constraints  across  conjuncts  and  consequently  ,n  some  situations 
toes  rot  compute  the  correct  regressed  expressions.  The  version  m  Figure  3.S  does  not  have  this  problem  since  each  subst-tu- 
t.or,  s  aop.ied  -o  all  of  the  current  regressed  expressions  in  the  set  R. 
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The  proof  ’hat  John  will  commit  suicide  is  shown  in  Figure  3  9.  its  explanation  structure  is  shown 
in  Figure  3.10.  and  the  correct  general  proof  that  anyone  who  is  depressed  and  buys  a  gun  will 
commit  suicide  is  shown  in  Figure  3.11.  The  general  macro-rule  learned  from  the  generalized 
explanation  is: 

Depressed(?vl)  A  Buy(?vl.?cl)  A  Isa(?cl.Gun)  —  Kill(?yl.?yl) 


Kill(John.John) 


Isa(Objl, Weapon) 


IsaCObjl.  Weapon) 

f 

i 

Isa(Objl.Gun) 

!! 

il 

Isa(Objl.Gun) 


PossessCJohn.Objl ) 

II 

I  I 

Possess!  John.Objl ) 
A 

Buv(John.Objl) 


Buv(John.Objl) 


Hate!  Jonn.John) 

ii 

I 

il 

Hate(John.John) 

A 

I 

Depressed!  John) 

j 

i 

Deoressed!  John) 


Figure  3.9:  Suicide  Example  —  Specific  Explanation 


Kill(?xl.?vl) 


Isa!°cl  .WeaDon) 

II! 

Isa(',x4.W  eapon) 
A 


Possess!  ?xl. I’d) 


Possess!  ?x  3. ?y  2) 
A 


Hate!?xl.?vl) 


Hate(?x2.',x2) 

A 


lsa(',.\4.Gun) 


Buv(?x3.?v2) 


Derressed!?x2) 


Figure  3.10:  Su-cide  Example  —  Explanation  Structure 
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Goal  regression,  as  given  in  [MitcheliS6]  and  Figure  3.11.  computes  only  the  most  general  set 
of  antecedents  that  would  support  a  proof  with  the  same  explanation  structure  as  the  training 
example  (i.e.  the  weakest  preconditions  [Dtikstra76.  MintonS4].).  If  oniv  goal  regression  is  per¬ 
formed.  the  proper  description  of  the  goal  concept  supported  by  the  explanation  is  not  always 
determined  since  the  explanation  itself  may  impose  constraints  cn  the  goal  concept.  In  terms  of  the 
Suicide  example,  the  constraint  that  the  killer  be  the  same  as  the  person  killed  is  never  imposed 
and.  as  demonstrated  in  [DeJongSbb],  EBG  constructs  the  following  erroneous  rule: 

DepressedCl’y )  '  BuyOyTc)  Isa(?c.Gun)  —  Kiil(?x.?y) 

This  rule  states  that  everyone  kills  someone  who  is  depressed  and  buys  a  gun.  which  is  clearly  not 
a  conclusion  warranted  by  the  domain  theory.  Since  the  abstract  STRIPS  algorithm  applies  substi¬ 
tutions  generated  by  each  unification  to  the  entire  explanation  structure,  it  computes  the  appropri¬ 
ately  constrained  goal  concept  and  does  not  make  this  mistake. 

As  suggested  in  [DeJong-S6bj.  the  proper  generalized  goal  and  generalized  explanation  can  be 
obtained  by  starting  with  the  generalized  antecedents  obtained  from  regression  and  rederiving  the 
general  proof  Rederiving  the  proof  propagates  constraints  from  the  regressed  expressions  lo  the 
goal,  thereby  appropriately  constraining  the  goal  concept.  The  resulting  generalization  algorithm  is 
then  a  two  step  process:  goal  regression  'back-propagation)  followed  by  proof  reconstruction  (for¬ 
ward  propagation This  approach  was  suggested  based  or.  a  similar  two-pass  generalization  process 
presented  m  .  Mahacevan&f].  A  formal  description  of  a  version  of  EBG  corrected  in  this  manner  is 
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t  [Mooney bob;.'  In  [kedar-CabeiliSTg,].  Kedar-Cabelli  and  McCarty  presented  a  PROLOG 
cf  E3G  u.  hich  also  corrects  the  problem  and  avoids  making  two  separate  passes  through  the 
t ion.  This  -version  of  E3G  is  considered  m  a  subsequent  section. 


3.2.3.  EGGS 


The  EGGS  (Explanation  Generalization  using  a  Global  Substitution)  algorithm  was  developed 
for  generalizing  explanations  of  the  abstract  form  defined  and  used  in  this  paper.  The  algorithm  is 
qu.te  similar  to  the  abstract  STRIPS  algorithm  and  is  shown  in  Figure  3. 12. 3  The  difference  between 
EGGS  and  the  abstract  STRIPS  algorithm  is  that  instead  of  applying  the  substitutions  throughout 
the  explanation  at  each  step,  all  the  substitutions  are  composed  into  one  substitution  y.  After  all 
the  unifications  have  been  performed,  one  sweep  through  the  explanation  applying  the  accumulated 
sucstituticn  y  results  ir.  the  generalized  explanation.  Table  3.1  demonstrates  this  technique  as 
applied  to  the  Cup  example.  It  shows  how  y  changes  as  it  is  composed  with  the  substitutions 
resulting  from  each  unification.  Applying  the  final  substitution  y  to  the  explanation  structure 
shown  ir.  Figure  3.-!  resuits  in  the  generalized  explanation  shown  in  Figure  3.5.  Table  3.2  shows 
how  EGGS  generalizes  the  Suicide  example.  Applying  the  final  substitution  to  the  explanation 
structure  shown  in  Figure  3.10  resuits  in  the  generalized  explanation  shown  in  Figure  3.11.  In  the 
tables,  equalities  are  processed  in  the  order  produced  by  depth-firs:  traversals  of  the  explanation 
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let  y  be  the  null  substitution  {} 

for  each  equality  between  patterns  x  and  y  in  the  explanation  structure  do 
let  9  be  the  MGU  of  xy  and  yy 
let  y  be  y 9 

for  each  wff  x  in  the  explanation  structure  do 
k-s  replace  x  with  xy 

s. 

Figure  3.12:  EGGS  Explanation  Generalizing  Algorithm 


M  JSir.ce.  is  mer.tior.ee  .r.  '.he  previous  footnote.  the  3acic?ropagate  function  presented  :r.  l.MocneySdb;  does  not  properly 

propagate  constraints  across  conjurers,  the  corrected  version  of  E3C  presented  in  [Mooney?6bj  required  mat  rcrsvarcPro- 
pagate  be  performed  before  Bac<?ropagate.  if  the  correct  version  of  bacit-propagation  presented  ir.  Figure  3.S  is  used,  if  is 
/•  r.c  necessary  to  perform,  icr’iard  propagation  erst. 

5H)ue  'o  a  •ypegraphicai  error,  'he  EGGS  algorithm  presented  :n  Moor.ey86cj  did  not  mciude  applying  y  to  x  and  >  pri¬ 
or  "c  comput.r.g  their  MGl.  The  angina,  publication  cl  the  algorithm  in  [DeJongSbb]  did  not  suOr  from  this  mistaite  and 

're  l.-nger  technical  reoort  "ersior.  o:  \Ioor.ev36ci  included  a  corrected  version  of  the  EGGS  algorithm  ' Moore v Wo;. 
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Table  3.1:  EGGS  Apolied  To  the  Cup  Example 


Ecualitv 


Stabiev^xl)  =  Stable'. ’’xi) 


Liftable(?xl )  =  Liftable(?x3) 


Graspable(?x3)  =  Gras  pable(  7x4) 


OpenVesselC’xl)  =  OpenVessel(?x5) 


9  i  y 


{ ?x  1  .'7x2 1  |  (?xl/?x2 


I?xl/°x3}  1  (?xl/?x2.  ?xl/?x3} 


I?xl.'?x5l  1  I?xl/?x2.  ?xl/?x3.  ?xl/?x4.  ?xl 


{?xl '7x4}  j  j?xl/?x2.  ?xl/?x3.  ?xl/?x4| 


7x1. '7x5! 


Table  3.2:  EGGS  Applied  To  the  Suicide  Example 


Pcssess(?xl.7cl)  =  Possess(?x3.1v2) 


Hate(?xl.?vl)  =  Hate(?x2.?x2) 


I  ’cl  7x4) 


Pxl  7x3 .  ',cl/"’v2) 


l?vl  "Lx2.  7v  1/7x1} 


{7cl/?x4| 


Pel, 7x4.  T\1  "7x3.  7cl. 


•  PclPx4.  ?xl  °x3  7cl/ 
'1vl.'7x2.  ?vl/7xl} 


structures:  however,  any  order  will  result  in  equivalent  generalized  explanations  up  to  a  change  of 
variable  names. 

3.2.4.  PROLOG-EBG 

In  [Kedar-CabelliS7a].  Kedar-Cabelli  and  McCarty  present  a  PROLOG  version  of  EBG  which, 
unlike  the  original  EBG.  computes  the  proper  gcai  concept.  PROLOG-EBG  integrates  the  generaliza¬ 
tion  process  with  the  construction  of  explanations  by  PROLOG.  A  generalized  proof  is  constructed 
in  parallel  with  the  proof  for  the  specihc  example.  Any  query  results  in  both  a  specific  and  and  a 
generalized  proof  being  returned. 

The  algorithmic  description  presented  in  Figure  3.13  is  an  attempt  to  specify  the  generaliza¬ 
tion  algorithm  underlying  PROLOG-EBG  as  an  independent  process  (i.e.  separated  from  the  process 
of  theorem  proving).  Like  EGGS.  PROLOG-EBG  constructs  a  global  substitution,  y.  which  is  then 
applied  to  the  complete  explanation  structure.  However,  unlike  EGGS,  y  is  constructed  by  travers¬ 
ing  the  explanation  depth-first  from  the  goal  in  a  manner  analogous  to  trying  to  p*  .ve  the  general 
goal  of  the  explanation  structure  using  backward-chaining.  The  substitution  y  is  constructed  by 
finding  a  substitution  that  allows  the  gcai  to  be  proved  from  the  set  of  operational  wffs  represented 
by  the  leaves  of  the  explanation  structure.  The  generalization  algorithm  is  analogous  to  the  algo¬ 
rithm  for  a  backward-chaining  deductive  system  (like  PROLOG  or  the  deductive  retrieval  system 
tr.  [CharmakSOj).  In  the  algorithm  in  Figure  3.13.  the  function  PROLOG-EBG  returns  two  values: 
the  current  substitution  (y)  and  the  subset  of  the  wffs  in  the  explanation  structure  that  have 
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already  been  traversed  (E).4  When  the  top-level  call  to  PROLOG-EBG  returns.  E  is  the  set  of  all 
wffs  :n  the  explanation  structure  and  y  is  the  hnai  global  substitution. 

It  should  be  noted  that  Hirsh  [HirshS7]  simultaneously  developed  a  version  of  EBG  for  logic 
programming  (using  the  MRS  logic  programming  system  [RussellSS])  which  uses  a  generalization 
algorithm  that  is  equivalent  in  operation  to  PROLOG-EBG's.  In  addition  to  integrating  theorem 
proving  and  generalization.  MRS-EBG  integrates  both  of  these  with  operationalitv  checking 
[M:tcheilS6]  or  pruning  [DeJong86b.  MooneyS6cj.  a  process  discussed  in  section  3.6. 

3.3.  Comparison  of  Explanation  Generalizing  Algorithms 

It  -is  reasonably  clear  that  STRIPS.  EGGS,  and  EBG-PROLOG  al!  compute  the  same  desired 
generalized  explanation.  They  all  perform  a  set  of  unifications  and  substitutions  that  constrain  the 


let  g  be  the  goal  wff  in  the  explanation  structure 
let  (y.  E)  =  PROLOC-EBG(g,  (!) 
for  e  in  E  do  replace  e  -with  ey 

procedure  PROLOG-EBG(x.  9) 

let  S  be  the  set  of  wffs  supporting  x 

for  s  in  S  do  replace  s  with  s9 

let  (y.  E)  =  PROLOG-EBG-Supporters(S.  (1.0) 

return  (y9.  E  U  (xl) 

procedure  PROLOG-E3G-Supoorters(S.  y.  E) 
if  S=0 

then  return  (y.  E) 
else 

let  f  be  the  hrst  element  of  S 
let  R  =  S  -  if! 

if  f  is  equivalent  to  some  wff  e 

then 

let  6  be  the  MGU  of  f  and  e 
let  (5.  P)  =  PROLOG-EBG(e.  6) 
for  r  in  R  do  replace  r  with  :5 
PROLOG-EBG-Supporters(R.  yS.  E  U  P) 
else  PROLOG-EBG-Supporters(R.  y.  E  U  (ft) 

Figure  3.13:  PROLOG-EBG  Explanation  Generalizing  Algorithm 


*7he  r.c'atior,  “  a. 
ill  variables  rsierer.e 


a -=  F1  x."  and  “return  ;  a.  b T  ;s  used  to  denote  the  fact  '.hat  'he  function 
td  o'  a  procedure  are  assumed  to  be  local  lo  tha:  procedure  call. 


•u  o  values:  a  and 
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explanation  structure  into  one  in  which  equivalent  wffs  are  identical.  The  difference  between  them 
lies  m  the  manner  and  order  in  which  the  unifications  and  substitutions  are  done.  As  described  in 
[O'RorkeSTbj.  expianation-based  generalization  can  be  viewed  as  a  process  of  posting  and  propagat¬ 
ing  equality  or  co-reference  constraints  Neither  the  STRIPS  or  EGGS  algorithm  impose  ar.  ordering 
on  the  assimilation  of  the  various  equality  constraints  in  the  explanation  structure.  On  the  other 
hand,  the  various  EBG  algorithms  order  the  assimilation  of  constraints  by  traversing  the  explana¬ 
tion  structure  depth-first.  Although  this  ordering  is  not  required  by  the  generalization  process,  it  is 
a  .natural  consequence  of  integrating  generalization  with  a  backward-chaining  theorem  prover. 

Actually,  the  task  of  producing  a  global  substitution  (y)  for  an  explanation  structure  can  be 
easily  shown  to  reduce  to  the  task  of  finding  a  most  general  unifier  for  two  wffs.  The  two  wffs  for 
the  reduction  are  constructed  by  having  equivalent  wffs  in  the  explanation  structure  occupy 
corresponding  argument  positions  m  the  two  constructed  wffs.  For  example,  below  are  two  wffs 
constructed  for  the  explanation  structure  of  the  CUP  example  (Figure  3.4). 

P( Stable! 7x2).  Lif*able(?x3 ).  Graspable(7x4).  OpenVesselC,x5)) 

PfStabiei  ?xl).  Liftabie(?xl).  Graspable(?x3).  OpenVesselC’xl)) 

A  MGU  for  these  two  wffs  is:  I7xl/',x2.  ',xl/"’x3.  ?xl/?x4.  ?xl/?x5|  which  is  the  same  as  the  global 
substitution  EGGS  constructed  for  this  example  (Table  3.1).  The  two  wffs  constructed  for  the  Sui¬ 
cide  exampie  are: 

P(Isa(?x4. Weapon).  Possess(7x3.?v2).  Hate('!x2.?x2)) 

PC Isa(?c  1  .Weapon).  Possess(7xl,?cl ).  Hate(?xl.'>yl)) 


A  MGU  for  these  two  wffs  is:  {?cl/'?x4.  ?xl/?x3.  ?cl/'?y2.  ?y 1/7x2.  ?y l/?xl}  which  is  again  the 
same  as  the  global  substitution  constructed  by  EGGS  (Table  3.2) 

Consequently,  in  some  sense  the  various  explanation  generalizing  algorithms  are  just  different 
ways  of  implementing  unification.  In  fact.  EGGS  directly  corresponds  to  the  implementation  of 
UNIFf  in  [CharmakSO]  wh  ich  takes  a  pair  of  wffs  and  a  current  substitution  and  returns  an 
updated  substitution  which  includes  variable  bindings  that  unify  the  two  wffs  in  the  context  of  the 
current  substitution.  The  STRIPS  generalizing  algorithm,  on  the  other  hand,  is  more  similar  to  the 
implementation  of  UNIFY  presented  in  [NilssonSO]  in  which  the  substitution  unifying  the  first  ele¬ 
ments  of  two  wffs  is  applied  to  the  rest  of  the  wffs  before  continuing.  A  unification  algorithm  that 
apriiec  each  substitution  to  the  entire  wff  (thereby  generating  the  resulting  unified  wff  as  well  as  a 
unifying  substitution)  would  be  exactly  equivalent  to  the  STRIPS  generalizing  algorithm. 


3.4.  Computational  Complexity  of  Explanation  Generalization 


The  reduction  to  unification  given  above  also  demonstrates  that  the  time  complexity  of  pro¬ 
ducing  a  global  substitution  is  linear  in  the  size  of  the  explanation  since  linear  time  algorithms  exist 
for  unification  [PatersonTS],  Since  unification  can  be  performed  in  linear  time,  obviously  the  size  of 
the  resulting  MGU  must  also  be  linear  in  the  size  of  the  explanation  since  only  a  linear  amount  of 
output  can  be  produced  in  linear  time.  Therefore,  if  !Ei  represents  the  size  of  the  explanation,  we 
can  let  c.E  be  the  time  required  to  construct  the  global  substitution,  and  c,E  be  the  length  of  the 
global  substitution.  Since  the  time  complexity  of  applying  a  substitution  to  a  wff  is  also  linear  in 
the  length  of  its  inputs1,  let  CjCcJEl  -*■  E)  be  the  time  required  to  apply  the  global  substitution  to  the 
explanation.  Therefore,  the  time  required  for  the  complete  process  of  constructing  a  generalized 
explanation  is: 

c3(c:E  -  E)  -  CjE  =  (c,  -  c3(c;  -  nxs 

which  is  clearly  linear  in  size  of  the  explanation. 

Although  this  result  does  not  reveal  the  time  complexity  of  the  individual  algorithms  in  sec¬ 
tion  3.2.  it  is  a  constructive  proof  of  the  existence  of  a  linear-time  explanation  generalizing  algo¬ 
rithm.  Since  linear-time  unification  algorithms  have  apparently  found  limited  use  in  practice  due 
to  large  overhead,  it  is  unlikely'  that  a  generalizing  algorithm  based  on  one  would  be  particularly 
useful  in  practice.  Nevertheless,  it  is  an  interesting  theoretical  result  which  supports  the  important 
ciatm  that  a  generalized  explanation  can  be  computed  very  efficiently. 

In  practice,  the  generalizing  algorithms  given  in  section  3.2  are  quite  efficient  using  a  standard 
non-linear  unifier.  In  chapter  7.  some  empirical  results  cn  EGGS  are  presented  which  include  data 
on  the  time  required  to  learn  from  various  examples.  However,  the  learning  times  presented  there 
include  the  time  required  for  pruning  the  explanation  and  packaging  it  into  a  macro-rule  as  well  as 
the  time  required  for  generalization.  Of  the  problems  reviewed  in  chapter  7.  the  most  complicated 
cne  to  generalize  is  the  "difficult  geometry"  problem  which  has  25  equalities  and  takes  only  3.6 
seconds  of  CPL‘  time  to  generalize. 

3-5.  Correctness  of  Explanation  Generalizing  Algorithms 

Intuitively,  in  order  for  an  explanation  generalizing  algorithm  to  be  "correct.”  its  output 
should  be  logically  entailed  by  the  system's  existing  know-ledge  or  domain  theory'  and  it  should  be 

'The  literature  rn  .inear  unnca :  :or.  does  not  discuss  i.r.ea:  time  substitution  application:  houever,  a  '.ir.ear  tine  algo 
r  :or  this  procedure  is  presentee  :r>.  apper.oix  A. 


as  general  as  possible  given  inis  constraint  and  the  constraint  that  it  retain  the  "structure"  of  the 
original  explanation.  To  dale  there  arc  no  complete  formal  proofs  of  the  correctness  of  anv  ol  the 
above  generalization  algorithms:  and  no  such  proofs  will  be  attempted  in  this  section.  However, 
proving  correctness  does  not  seem  to  be  an  impossible  task  and  this  section  summarizes  several 
approaches  that  could  be  taken. 

If  it  is  assumed  that  explanations  are  logical  proofs  (as  in  [MiichellS6j).  one  would  f.rst  want 
to  prove  soundness .  i.e.  that  the  learned  macro-rule  is  logically  entailed  by  the  existing  domain 
theory.  For  PROLOG-EBG.  it  is  particularly  dear  that  the  generalization  algorithm  is  simply  per¬ 
forming  deduction  on  Horn  clauses  and  thereby  constructing  a  proof  of  the  generalized  goal  concept 
from  the  operational  leaves  of  the  explanation.  Consequently,  proving  soundness  could  make  pro¬ 
ductive  use  of  existing  proofs  of  the  correctness  of  the  deductive  algorithms  underlying  PROLOG 
[LlcydSd.  Robtnsonbo].  However,  proving  that  the  learned  macro-rule  is  the  most  general  one  pos¬ 
sible  wouid  probably  be  more  difficult. 

If  generalization  is  done  by  performing  a  single  unification  as  described  in  section  3.3  (and  the 
unification  algorithm  has  been  proven  correct),  proving  soundness  for  logical  proof  explanations 
becomes  straight-forward.  In  this  approach,  all  of  the  equivalent  wffs  in  the  generalized  explana¬ 
tion  must  be  identical  since  unification  can  be  proven  to  produce  a  substitution  that  will  make  us 
arguments  identical  [Robinson65].  Since  all  of  the  Horn  clauses  in  the  generalized  explanation  are 
instantiations  of  clauses  in  the  domain  theory  (i.e.  they  are  the  result  of  applying  the  global  substi¬ 
tution  to  the  explanation  structure),  the  logically  sound  inference  rule  of  universal  insiaruiacion 
guarantees  that  they  are  entailed  by  the  domain  theory.  Finally,  one  needs  to  show  that  computing 
a  macro-rule  is  simply  performing  logically  sound  deduction  on  the  Horn  clauses  in  the  generalized 
explanation.  If 

k ,  ...  k  ,  A  k  A  k  ...  k  —  c 

and 

l,  A  ...  I  -*  d 

l  ri 

are  two  clauses  in  the  generalized  explanation,  and  d  is  equivalent  to  k  .  then  d  and  -t  must  be 
identical  literals.  Assume  the  second  clause  is  removed  from  the  generalized  explanation  and  the 
first  clause  is  replaced  by: 

k  .  ...  k  ,  A  i  r  ...  I  ;  k  ,  ...  k  —  c 

Since  d  and  *  are  identical  literals,  the  added  clause  is  entailed  by  the  domain  theory  because  it  is 
the  resolvent  of  the  two  clauses  and  the  resolution  rule  is  sound  [Robir.sor.65 ].  Repeating  this 


process  tor  every  set  oi  equivalent  patterns  reduces  the  generalized  explanation  to  the  desired 
macro-rule.  Since  ail  oi  the  clauses  in  the  original  generalized  explanation  are  entailed  by  the 
domain  theory,  and  since  the  clause  added  by  each  step  m  the  reduction  process  is  entailed  by  the 
existing  clauses,  by  induction,  the  completely  reduced  generalized  explanation  (i.e.  the  learned 
macro-rule)  is  entailed  by  the  domain  theory. 

If  it  could  be  proven  that  a  generalization  algorithm  such  as  EGGS  or  EBG  computes  the 
correct  global  substitution  and  is  therefore  equivalent  to  the  single  unification  algorithm,  then  it 
would  follow  that  this  algorithm  was  also  sound.  For  EGGS,  this  would  involve  proving  that  the 
unification  algorithm  in  [CharniakSOj  is  correct. 

Another  approach  to  proving  correctness  of  explanation  generalization,  is  discussed  in 
[0'RorkeS7b]  and  involves  demonstrating  that  a  generalization  algorithm  maintains  the  equality  or 
co-reference  constraints  of  the  explanation  structure  in  the  most  general  wav  possible.  As  described 
in  [0‘RorkeS7b].  explanation  generalization  can  be  performed  by  combining  the  individual  co¬ 
reference  constraints  in  order  to  compute  the  most  general  description  of  each  expression  in  the 
explanation  that  satisfies  all  of  these  constraints.  More  details  on  this  approach  to  verification  and 
how  it  specifically  applies  to  generalizes  based  on  unification  are  giver,  in  [0’RorkeS7bj. 

The  formal  definition  of  a  generalized  explanation  presented  in  section  3.1  captures  ORorkes 
notion  of  correctness  since  it  requires  the  global  substitution  y  to  be  the  most  general  substitution 
that  makes  all  equivalent  wffs  in  the  explanation  structure  identical.  Since  unification  produces  the 
most  general  substitution  that  makes  two  wffs  identical  (as  proved  in  [Robinson65]).  the  single 
unification  algorithm  is  guaranteed  to  produce  the  correct  y  and  therefore  the  correct  generalized 
explanation.  In  this  case,  a  complete  correctness  proof  for  any  of  the  individual  algorithms  in  sec- 
liCR  3.2  would  require  proving  that  the  algorithm  is  equivalent  to  the  single  unification  algorithm. 
As  previously  mentioned,  for  the  EGGS  algorithm,  this  proof  would  involve  proving  the  correct¬ 
ness  of  the  unification  algorithm  in  [CharniakSO], 

3.6.  Pruning  Explanations  for  Operationally 

Often,  the  explanation  structure  for  a  particular  example  is  too  specific  to  support  a  reason¬ 
ably  useful  generalization.  In  these  cases,  the  operationaiity  criterion  [M;tcheil86]  is  met  by  nodes 
higher  in  the  explanation  tree  than  the  leaves  and  it  is  adoisabie  to  prune  units  from  the  explana¬ 
tion  structure  that  are  more  specific  than  required  for  operationaiity.  If  this  pruning  is  acne  prior 
to  generalization  as  shown  in  Figure  3.1.  it  will  result  in  a  more  abstract  generalized  explanation 
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■vr.ich  :s  applicable  to  a  broader  range  of  examples  For  example,  il  the  rale  for  inferring  Graspable 
:>  removed  from  the  explanation  structure  snown  in  Figure  3.3.  the  following  more  general  (but 
less  operational)  definition  of  Cup  is  acquired- 

BottomC’yl)  f'  PartOf(?vl.?xl)  A  Flat(°yl )  \  GraspableC’xl )  A  Light(7xl ) 

'  Concavity(?y3)  A  PartOf(7v3.7xl )  A  L'pwardPcintmg(?y3)  —  Cup(?xl) 

More  appropriate  examples  of  pruning  in  various  domains  will  be  giver,  in  the  following  two 
chapters  and  in  chapter  11 

Determining  the  appropriate  operationally  criterion  has  been  the  subject  of  much  discussion 
in  the  EBL  literature  [DeJongS6b.  Keller87b.  Mitchell86.  Segre87b.  Shavhk87b].  There  is  clearly  a 
trade-off  that  must  be  resolved  between  operationality  and  generality.  A  more  general  explanation 
is  useful  in  a  larger  set  of  future  situations:  however,  it  is  also  harder  to  apply  in  those  situations. 
A  more  specific  explanation,  on  the  other  hand,  is  easier  to  a  apply  to  future  situations:  however,  it 
is  less  applicable.  In  the  long  run.  it  is  probably  best  to  retain  explanations  at  several  levels  of  gen¬ 
erality  as  suggested  in  [Mooney85b]  and  as  done  in  the  PHYSICS101  system  [Shavlik87b].  This 
allows  a  more  specific  explanation  to  be  used  when  it  is  applicable  while  still  permitting  a  more 
genera;  explanation  to  be  used  when  a  more  operational  one  is  not  available. 

A  recent  suggestion  for  determining  operationality  is  the  one  used  in  ARMS,  an  EBL  system 
for  robotics  [Segre87bj.  It  involves  pruning  all  of  the  explanation  below  shared  substructure.  In 
terms  of  the  representations  used  here,  this  approach  would  prune  all  nodes  below  the  point  where 
a  subgraph  of  the  explanation  becomes  a  tree  as  opposed  a  general  directed  acyclical  graph.  In  other 
words,  it  keeps  pruning  leaves  of  the  explanation  until  a  node  is  found  that  supports  more  than  one 
other  node.  Although  this  pruning  algorithm  may  work  well  for  the  ARMS  domain,  it  is  not  a  gen¬ 
eral  solution  to  the  problem  of  determining  operationality.  Many  explanations  that  support  useful 
generalizations  do  not  have  any  shared  substructure.  In  fact,  most  of  the  examples  of  explanations 
presented  ir.  the  following  three  chapters  are  trees  and  consequently  do  not  have  shared  substruc¬ 
ture.  The  ARMS  approach  to  pruning  would  remove  the  entire  explanation  in  such  cases,  and  con¬ 
sequently  miss  the  opportunity  to  learn  useful  new  rules  and  operators. 

Therefore,  at  this  pome  in  time,  determining  which  predicates  or  operators  are  operational  is 
generally  a  domain  dependent  decision.  Consequently,  the  current  EGGS  system  simply  has  a  hook 
that  allows  an  arbitrary  pruning  function  to  be  called  before  an  explanation  is  generalized.  Some 
examples  of  pruning  functions  are  given  in  the  following  two  chapters  and  in  chapter  11.  In 
Hirsh  s  MRS-E8G  system  [Hirsh57j.  meta-ievel  iogical  decuction  is  used  to  determine 
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operationally  This  approach  has  ;he  advantage  ol  allowing  operationalay  proofs  themselves  to  be 
generalized  m  an  explanation-based  manner  ;n  order  to  determine  the  most  general  operational 
explanation. 
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3.7.  Integrating  Explanation  Construction,  Pruning,  and  Generalization 

Instead  of  performing  the  first  three  steps  in  Figure  3.1  sequentially,  these  steps  can  often  be 
integrated  and  performed  in  an  interleaved  fashion.  As  discussed  in  [DeJongS6b.  MooneyS6c],  the 
EGGS  generalization  algorithm  is  easily  integrated  with  the  explanation  building  process  by  updat¬ 
ing  the  global  substitution  each  time  a  new  rule  is  added  to  the  evolving  explanation.  As  men¬ 
tioned  earlier.  PROLOG-EBG  elegantly  integrates  generalization  with  the  theorem  proving  process, 
ar.d  MRS-EBG  elegantly  integrates  both  of  these  processes  with  pruning  the  explanation  for  opera- 
tionalitv  . 

Although  integrating  these  processes  is  aesthetically  appealing,  there  is  a  price  associated  with 
it.  For  example,  the  integration  of  theorem  proving  and  generalization  in  PROLOG-EBG  and  MRS- 
EBG  involves  unnecessarily  generalizing  dead-end  branches  of  the  search  tree  which  are  eventually 
abandoned  and  never  become  part  of  the  final  proof.  If  generalization  were  postponed  until  the 
final  proof  is  available,  this  useless  computation  could  be  avoided.  However,  as  noted  in  [HirshS7], 
integrating  generalization  and  theorem  proving  is  useful  when  there  are  multiple  possible  explana¬ 
tions  for  the  specific  example  only  some  of  which  are  operational.  In  this  case,  integrated  generali¬ 
zation.  theorem  proving,  and  operationality  checking  allows  theorem  proving  to  continue  until  an 
operational  proof  is  eventually  found. 

Another  problem  with  integration  is  that  in  many  cases  operationality  cannot  be  determined 
until  the  complete  explanation  is  available.  When  learning  by  observing  the  problem  solving 
behavior  of  an  external  agent,  the  eventual  goal  to  be  achieved  is  generally  unknown  until  all  of 
the  agent  s  actions  have  been  observed.  However,  the  pruning  algorithm  often  requires  knowledge 
of  the  goal.  For  example,  the  pruning  algorithm  used  in  GEXESIS  (see  chapter  11)  needs  to  know 
what  actions  and  properties  eventually  support  the  achievement  of  the  goal.  Consequently,  in  these 
situations,  pruning  must  be  postponed  until  the  complete  explanation  has  been  constructed.  If  gen¬ 
eralization  is  performed  before  pruning,  the  resulting  generalization  may  be  too  specific  since  it  may 
incorporate  constraints  introduced  by  the  pruned  parts  of  the  explanation.  Therefore,  when  prun¬ 
ing  must  be  performed  after  the  explanation  is  complete,  generalization  and  explanation  cannot  be 
easily  integrated.  As  discussed  in  [DeJor.gS6bJ.  if  generalization  and  explanation  are  integrated. 
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additional  constraints  introduced  by  pruned  sections  of  me  explanation  car.  later  oe  retracted:  how- 
e  -  e..  retracting  constraints  is  ve-y  difficult  and  requires  the  capabilities  of  a  truth  maintenance  sys¬ 
tem  (TMS';  [Dcyle79].  The  MA  system  [O'RorkeSTbj  is  an  example  of  an  EBL  system  that  uses  a 
TMS  [McAliesterS2]  to  retract  co-reference  constraints:  however.  this  system  is  very  slow  com¬ 
pared  to  simpler  systems  based  on  unification.  Because  of  ail  these  problems,  explanation  con¬ 
struction.  pruning,  and  generalization  are  performed  sequentially  in  the  EGGS  system  as  shown  in 
Figure  3.1. 

3.S.  Explanation  Generalization  Versus  Chunking  and  Production  Composition 

Explanation-based  learning  of  macro-rules  and  macro-operators  is  closely  related  to  produc¬ 
tion  system  learning  mechanisms  that  compose  production  rules.  The  chunking  process  in  SOAR 
[LairdS-l.  LairdS6a.  LairdS6b]  and  the  knowledge  compilation  process  of  composition  in  ACT* 
[ A.ndersonS3b.  Anderson86j  are  two  similar  production  system  learning  models.  Both  processes 
build  macro-productions  based  on  traces  of  productions  produced  bv  the  problem  solver  when 
solving  a  particular  problem. 

Besides  the  fact  that  these  systems,  unlike  STRIPS.  E3G.  and  EGGS,  do  not  rely  on  a  logic- 
oased  representation,  the  primary  difference  lies  in  the  less  analytical  generalization  process. 
SOAR's  generalization  algorithm  is  described  in  detail  in  [Rcsenbioom86]  where  it  is  compared  and 
contrasted  to  E3G.  The  generalization  process  is  basically  one  of  changing  constants  to  independent 
■. ariafcles.  However,  due  to  the  difference  in  representation  language,  the  problem  of  over- 
generalization  mentioned  in  section  3.2.1  is  avoided.  Constants  in  SOAR  come  in  two  types. 
: Jen: tiers  which  are  symbols  for  particular  objects,  and  more  meaningful  constants  such  as  "5" 
and  "blue."  For  example,  representing  the  logical  assertion  Color(3.blue)  requires  creating  an  extra 
identifier  for  the  constant  "blue"  and  using  the  two  assertions  Color (13.0  and  Xame(£.  blue).  The 
generalization  process  ;n  SOAR  changes  only  identifiers  tc  variaoles.  and  since  production  rules  can¬ 
not  :heck  for  particular  identifiers,  the  over-generalization  problem  is  avoided.  The  cost  incurred 
:  or  avoiding  the  problem  in  this  manner  is  an  extra  distinction  in  the  representation  language. 

However,  the  problem  of  under-generalization  mentioned  in  section  3.2.1  remains.  For  exam¬ 
ple.  ;r.  the  SOAR  formulation  of  the  Safe-To-Stack  problem6  presented  in  [RosenbloomS6],  the  rule 
'.earned  is: 

‘Vr  sec  nor.  J  2.1  :or  i  corr.pieie  spec  idea  nor.  of  tins  example,  ■vnch  :s  cnrir.a ;:  y  f  rcrn  1  Milc.netlSb]. 


rr.e;  x.v 


Der.sitv  x.a )  *  Name' v.endtable)  '  Proouc;(v.d.d)  •*  Less(d.w)  '•  \ameiw.5 


—  Safe-To-Stacklx.v) 


As  noted  m  ’RosenbloomSb],  this  rule  is  an  under-generalization  since  it  requires  the  density  and 
the  weight  of  the  cf  object  being  stacked  to  be  the  same  (i.e.  d).  Since  the  box  in  the  example  just 
happened  to  have  the  same  weight  and  density,  the  simple  variabhzation  process  requires  them  to 


be  the  same.  Fcr  the  same  reason,  the  initial  and  final  rooms  would  unnecessarily  be  required  to  be 


same  if  this  technique  were  applied  to  the  STRIPS  example.  Retaining  such  spurious  features  of  the 
example  in  the  generalization  is  a  basic  violation  of  the  explanation-based  approach.  EBL  stipulates 
that  only  these  constraints  required  to  maintain  the  validity  of  the  solution  should  be  incorporated 


in  the  generalization. 


In  response  to  the  problem  of  under-generalization.  Rosenbioom  and  Laird  [RosenbloomS6] 
state  that:  "If  an  example  were  run  in  which  the  density  and  the  weight  were  different,  then  a  rule 
would  be  learned  to  deal  with  future  situations  in  which  they  were  different. (p.  564)"  However,  if 
the  mere  general  rule  was  learned  from  the  original  example,  this  new  example  could  be  solved 
quicker  using  the  learned  rule.  Also,  unless  a  check  is  made  to  remove  subsumed  rules  (i.e.  rules 
that  are  specializations  of  existing  rules),  this  solution  leaves  useless  rules  lying  around  which 
decrease  the  performance  by  increasing  the  number  of  rules  the  system  must  check  for  application. 
Rcsenfcicom  and  Laird  also  state  that:  "The  Soar  approach  to  goal  regression  is  simpler,  and  focuses 
on  the  information  in  working  memory  rather  than  the  possibly  complex  patterns  specified  by  the 
rules. ...(p.  564  i"  if  the  problems  with  a  simple  constant  to  variable  generalization  algorithm  were 
offset  by  a  marked  increased  in  efficiency  of  generalization,  then  perhaps  a  case  could  be  made  for 
the  simpler  generalization  process.  However,  as  shown  in  section  3.4.  the  computational  complex¬ 
ity  of  a  unification-based  generalization  algorithm  is  linear  in  the  size  of  the  explanation.  Since 
simply  tracing  through  the  complete  explanation  replacing  each  constant  with  a  variable  is  aiso  a 


linear  process,  the  gain  in  efficiencv  is  at  most  a  constant  factor. 


Of  course.  SOAR  could  probably  be  modified  to  include  a  generalizer  that  prevents  under- 
generalization.  This  would  require  retaining  copies  of  the  general  parameterized  productions  (with 
unique  variables)  in  the  production  traces  produced  during  problem  scivi.ng.  Generalization  would 
then  require  constructing  the  most  general  set  of  variable  bindings  that  allows  ail  of  the  left- 
hand-sides  of  general  productions  to  match  the  right-hand-sides  of  the  general  productions  that 
they  support  :r.  tne  production  trace.  A  procedure  analogous  to  unification  for  matching  produc¬ 


tion  conditions  could  be  used  to  produce  the  required  giobal  substitution. 


Fmallv.  regarding  composition  in  ACT’,  discussions  o:  '.he  underlying  generalization  process 
oduciicn  composition.  [AndersonS2.  Andersonfs3a.  AndersonSSb.  AndersonJs6j  tail  to  give 
it  details  of  the  generalization  algorithm.  However,  the  fact  that  no  mention  is  made  of  the 
sties  of  generalization,  the  limits  of  simply  changing  constants  to  variables,  or  the  use  of  a 
ilizer  that  analyzes  variable  bindings  indicates  that  a  mere  analytic  generalize.1*  is  not  used, 
the  examples  giver,  of  composition  involving  telephone  dialing  [  AnderscnS2].  geometry  prob- 
ioiving  [AndersonSoa.  AntiersonSSb],  and  Lisp  programming  [ AndersonS3b.  AndersonS6. 
rsonST’o]  can  ail  be  accomphsned  by  simply  changing  constants  to  variables.  An  example  of 
mg  EGGS  to  a  geometry  example  from  [AndersonS’b]  is  giver,  in  section  4.2.5  However, 
e  many  of  the  other  examples  EGGS  has  been  tested  on.  the  proper  generalization  for  this 
pie  could  be  obtained  by  simply  changing  constants  to  v  ariables 


.  -  .  V  v.  V.  -- 
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CHAPTER  4 


EGGS:  LOGICAL  PROOF  EXPLANATIONS 


In  most  domains,  explanations  can  be  represented  as  logical  proofs  that  an  example  meets  the 
definition  of  a  concept  or  that  a  combination  of  operators  achieves  a  goal.  Most  attempts  to  formal¬ 
ize  explanation-based  learning  have  represented  explanations  as  logical  proofs  [HirshST.  Kedar- 
CaoelliS'a.  MitcheiI56.  PrieditisSTl.  The  Cup  example  presented  in  the  precious  chapter  is  an 
example  of  no\c  proofs  are  represented  as  explanations  in  EGGS.  This  section  discusses  the  EGGS 
system  s  ability  to  construct,  understand,  generalize,  and  learn  macro-rules  from  logical  proofs.  It 
also  presents  numerous  examples  of  applying  EGGS  to  various  domains. 


4.1.  Facilities  in  EGGS  Supporting  Logical  Proof  Explanations 

The  complete  EGGS  system  is  equipped  with  general  purpose  sub-systems  for  proving 
theorems,  understanding  incomplete  proofs,  and  generating  macro-rules.  Figure  4.1  illustrates  how 
these  components  combine  with  the  generalizer  to  comprise  a  complete  learning  system.  The 
theorem  prover  and  proof  verifier  allow  EGGS  to  construct  explanations  by  either  proving  asser¬ 
tions  itself  or  supplying  reasons  and  filling  in  gaps  in  sketchy  proofs  provided  by  the  user.  These 
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Figure  4.1:  EGGS  Architecture  for  Logical  Proof  Explanations 


explanations  can  men  be  generalized  using  the  EGGS  generalization  algorithm,  resulting  in  maim  - 
ru.es  that  can  be  used  to  construct  and  understand  similar  proofs  mere  efficiently. 

4.1.1.  Theorem  Proving 

Tr.e  EGGS'  theorem  prover.  like  the  theorem  prover  underlying  PROLOG  [ClocksinS4],  is  a 
back*  ard-chaimr.g.  depth-first  prover  for  Horn  clauses.  Specifically,  it  is  a  version  of  the  deductive 
-ctri ever  gn.en  m  ’CharmakSO]  modified  to  return  proof  trees  (explanat.onsj  for  each  answer  it 
retrieves.  instead  cf  automatically  returning  all  possible  answers  to  a  query,  it  uses  generators 
’CharmakSO.  TeitelmanS3]  to  produce  one  answer  at  a  time.  Horn  clause  inference  ruies  for  the 
domain,  are  indexed  under  the  predicate  name  of  their  consequent  As  in  PROLOG,  ruies  for  con¬ 
cluding  a  particular  predicate  are  ordered  and  are  tried  in  order  when  searching  for  a  proof.  The 
EGGS'  retriever  can  also  be  given  a  depth  bound,  d.  to  prevent  it  from  chaining  more  than  d  rules 
deep  when  attempting  to  retrieve  a  particular  fact. 

As  ar.  example  of  the  performance  of  the  EGGS  theorem  prover.  consider  the  Cup  example 
Giver,  the  facts  and  rules  given  in  section  3.1  and  the  goal  Cup(Ob’l)  (or  just  Cup(?x))  the  EGGS 
retriever  produces  the  proof  shown  in  Figure  3.3. 

Vv'her.  a  goal  is  successfully  deduced,  it  is  added  to  the  database  of  facts  along  with  the  facts 
composing  its  proof.  Data  dependencies  [Charr.iakSO.  Dovie79]  are  maintained  between  ail  the 
facts  m  the  database.  These  dependencies  are  used  to  provide  explanations  for  facts  in  the  database 
and  ccuid  also  be  used  to  update  the  a  database  following  the  retraction  of  ar.  assertion  The  latter 
task  is  one  of  the  primary  functions  of  a  truth  maintenance  system,  [de  RieerS6.  Doyie79.  McAlies- 
ter$2j  and  is  not  provided  in  the  current  version  of  EGGS. 

4.1.2.  Proof  Verification 

The  deductive  retriever  can  also  be  used  to  supply  reasons  and  nil  in  gaps  in  sketchy  proofs 
given  to  the  system.  Sketchy  proofs  are  ordered  sequences  of  subgoals  or  lemmas  to  be  proven 
nefore  attempting  to  prove  the  ultimate  goal.  For  example,  instead  of  trying  to  prove  Cup(Obil  ' 
directly,  the  system  can  be  given  the  following  incomplete  proof: 

Graspablet  Obi  I ) 

Liftacle'  Ob]  1 ) 

Open  Vessel'  Obi  1  5 

Q£D  Cur  Ob;  1? 


The  system  attempts  to  prove  that  each  step  in  the  proof  follows  deductively  from  the  initial  facts 
together  with  the  facts  deduced  in  previous  steps.  When  an  intermediate  step  is  proven,  it  is  added 
to  the  database  together  with  the  facts  and  dependencies  m  its  proof.  If  it  is  known  that  the 
sketchy  proof  w'iii  not  be  missing  more  than  d  inferences  between  steps,  then  the  depth  bound  on 
the  prover  can  be  set  tc  d  during  verification.  The  end  result  of  processing  a  sketchy  proof  is  a 
complete  proof  tree  for  the  goal.  For  the  Cup  example,  the  proof  constructed  by  the  verifier  is  the 
same  as  the  one  generated  by  the  theorem  prover  (see  Figure  3.3).  More  realistic  examples  of 
understanding  incomplete  mathematical  proofs  are  giver,  in  sections  4.2.3  and  4.2.5. 

Since  in  the  simple  Cup  example  there  is  only  one  rule  for  concluding  each  predicate,  expli¬ 
citly  supplying  subgoals  does  not  reduce  the  search  space  and  is  no  more  efficient  than  proving  the 
goal  directiy.  Frequently,  however.  using  a  sketchy  proof  can  greatly  reduce  search  over 
undirected  theorem  proving.  For  example,  let  <lr  f,,  ...  I  >  be  a  sketchy  proof  of  ln  from  the  set 
of  initial  facts  F.  Assume  that  l,  can  be  proven  from  F  with  a  proof  cf  depth  n  and  that  each  l. 
(2 <i^n)  can  be  proven  from  F  U  with  a  proof  of  depth  m  (specifically,  the  path  from  l, : 

to  i.  in  the  proof  is  of  length  m).  Figure  4.2  shows  an  abstract  specification  of  the  complete  proof. 
Let  d-nm  be  the  depth  of  the  complete  proof.  The  verifier  can  be  used  to  complete  this  sketchy 
proof  bv  setting  the  depth  bound  to  m.  The  number  of  nodes  searched  to  prove  each  lemma  will 
then  be:  0<  o'"),  where  b  is  the  average  branching  factor.  Since  there  are  n-dim  lemmas,  the  total 
search  needed  to  complete  the  proof  is: 

0(—  b") 

m 

As  m  gets  large.  b~  becomes  the  dominant  term.  This  is  because  by  L'Hopital's  rule: 

..  dbm  dbmln{b) 

iim  -  =  - 

171  —QO  nx  1 

Consequently,  the  search  time  needed  to  verify  the  sketchy  proof  is  Oidb^lnib  I).  Therefore,  for 
sufficiently  large  problems,  a  decrease  in  the  number  of  inferences  between  steps  in  such  a  sketchy 
proof  can  cause  an  exponential  decrease  in  the  time  needed  to  prove  the  theorem.  Since  unaided 
theorem  proving  is  simply  a  degenerate  case  of  a  sketchy  proof  in  which  m=d.  using  a  sketchy 
proof  can  be  bJ  ’  times  faster  than  proving  the  theorem  directly. 


4.1.3.  Learning  Macro-Rules 

Whether  a  proof  is  generated  by  direct  theorem  proving  or  by  explaining  a  sketchy  proof,  it 
can  be  generalized  and  used  to  learn  a  new  macro-rule.  A  macro-rule  is  obtained  from  the  general¬ 
ized  explanation  by  taking  the  goal  (root)  as  the  consequent  and  the  leaves  as  the  antecedents. 
Leaves  that  are  Horn  clauses  that  do  not  have  antecedents  but  are  from  the  domain  theory  (e.g 
Equal(?x.?x))  represent  general  facts  of  the  domain,  and  therefore  are  not  included  in  the  list  of 
antecedents  for  the  macro-rule  since  they  are  always  true  in  the  given  domain.  The  macro-rule 
resulting  from  this  process  is  deductively  implied  by  the  existing  domain  theory  since  the  macro¬ 
rule  is  obtained  by  chaining  together  existing  domain  rules  in  a  particular  manner.  The  macro-rule 
learned  from  the  Cup  example  was  presented  in  section  3.1. 

After  a  macro-rule  is  generated,  it  is  added  to  the  beginning  of  the  current  list  of  rules  for 
inferring  instances  of  the  same  predicate.  Since  the  backward  chaining  algorithm  tries  the  rules  m 
order,  this  insures  that  the  learned  rule  will  be  tried  first  in  the  future,  resulting  in  performance 
improvement  :f  the  problem  can  be  soived  using  the  learned  rule.  If  the  new  ruie  cannot  be  used  to 
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soive  ;he  problem,  then  it  eventually  resorts  to  using  its  previous  knowledge  to  solve  the  problem 
In  this  case,  performance  may  be  degraded  since  time  was  wasted  attempting  to  apply  the  learned 
rule.  Some  empirical  results  as  well  as  further  discussion  of  the  effect  of  rule  learning  on  tneorem 
proving  performance  is  given  in  chapter  7. 

If  the  EGGS  system  is  used  solely  as  an  independent  theorem  prover  which  learns  macro¬ 
rules.  it  closely  resembles  a  learning  PROLOG  interpreter,  like  that  described  in  [PrieditisS7],  On 
the  other  hand,  if  it  used  to  analyze  sample  proofs  and  learn  macro-rules  from  them,  it  behaves 
more  like  a  learning  apprentice  system  [MitchellS5.  0'Rorke54J.  The  important  thing  to  note,  how¬ 
ever.  is  that  the  same  generalization  algorithm  underlies  both  approaches  to  learning. 

4.2.  Logical  Proof  Examples  in  Several  Domains 

Many  of  the  examples  of  explanation-based  learning  in  the  literature  can  be  formalized  as 
learning  macro-rules  for  logical  deduction.  A  deductive  formulation  of  Winston's  Cup  example 
and  the  Suicide  example  were  given  in  chapter  3.  This  section  presents  additional  examples  from 
six  different  domains.  It  should  be  noted  that  since  equated  wffs  are  always  identical  in  specific 
and  generalized  explanations,  equivalent  wffs  are  collapsed  in  future  graphs  of  explanations. 

4.2.1.  Safe-To-Stack  Example 

The  Safe-To-Stack  example  is  a  simple  example  introduced  in  [Mitchell86]  to  illustrate  the 
EBG  explanation-based  generalization  method.  It  involves  learning  an  operational  rule  for  when  it 
is  safe  to  stack  something  on  an  endtable.  The  example  involves  the  following  facts: 

Isa(Objl.Box).  Isa(Obj2.Endtable)  Color(Objl.Red). 

CoIor(Obj2.Blue).  Volume(Objl.l).  DensitvCObjl .0. 1 ) 

The  following  domain  rules  can  be  used  to  prove  that  Objl  can  be  safely  stacked  on  Obj2: 

LighierC’x.'V)  —  SafeToStack(7x.?v) 

-FragileC’v)  —  SafeToStack(?x.?y) 

Voiume(?x.?v)  A  Density(.?x.?d)  —  W>ight(?x.?v*?d) 

W'eight(?x.",w)  A  Weight(?y.?u)  A  Less(?w.?u)  —  Lighter(?x.!’y) 

Isa(?x. Endtable)  —  W'eightC’x.fi) 

The  specific  proof  that  the  EGGS'  deductive  retriever  generated  for  this  problem  is  shown  in  Figure 
4.3  and  the  generalized  proof  is  shown  in  Figure  4.4.  The  general  rule  learned  from  this  example  is: 
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SafeToStack(Objl,Obj2) 

A 

I 

Lighter(0bjl,0bj2) 


WeighttObjl.lM) 


\A'eight(Obj2,S) 


Less(lM.S) 


Volume(Objl,l.' 


Density(Objl,.l) 


isa(.Ohj2,EndTable) 


Figure  4.3:  Safe-To-Stack  Example  —  Specific  Explanation 


SafeToStack(?xl,°yl) 

t 

LighteH?xl,?yl) 


Weight(?x,l.?vi*^dl) 


Volume(?xl,?vi) 


Density<?xlt?dl) 


Weight(?yl.5) 

A 

I 

Isa(?yl,EndTable) 


Less(?vl*?dl,5) 


Figure  4.4:  Safe-To-Stack  Example  —  Generalized  Explanation 


VoIume(?xl.?vl)  A  Densitv(?xl.?dl)  A  Isa(?yl  .Endtable)  A  Less(vl*dl.5)  — 
SafeToSiack(?xl.?vl ) 

This  rule  can  then  be  used  to  solve  similar  problems,  such  as  showing  that  a  book  can  be  safely 
stacked  on  an  er.dtable  given  the  facts  below: 

IsafObjl .Book).  Isa(Obj2. Endtable).  Color! Objl. Green). 

Color(Obi2. Brown).  Volume(Objl..5).  Density(Objl  .2) 
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4.2.2.  LEAP  example 

The  LEAP  system  [MitchellSd]  :s  a  learning  apprentice  in  \  LSI  design  which  observes  the 
behavior  of  a  circuit  designer.  It  attempts  to  learn  in  an  explanation-based  fashion  by  observing 
and  analyzing  specific  examples  of  logic  design.  As  an  example  of  learning  in  this  domain,  consider 
the  following  example  taken  from  [Mitchell85],  Given  the  task  of  implementing  a  circuit  that  com¬ 
putes  the  logical  function:  (a  V  b)  A  (c  V  d).  a  circuit  designer  creates  a  circuit  consisting  of  three 
NOR  gates  like  that  shown  in  Figure  4.5.  The  system  attempts  to  verify  that  the  given  circuit 
actually  computes  the  desired  function.  The  explanation  constructed  by  EGGS’  deductive  retriever 
proving  that  the  circuit  computes  a  function  that  is  equivalent  to  the  desired  function  is  shown  tn 
Figure  4.6.  In  this  example,  the  domain  knowledge  available  to  the  system  includes: 


Remove  Double  Negation: 
DeMorgan's  Lew: 

Substitution  of  Equals: 
Equivalence  Axiom: 

The  generalized  form  of  this  proof 
pie  is: 


Equiv(?x.?v)  —  Equiv(-«(->(?x)).?y) 

Equiv((->?xA-?y).?a)  —  Equiv( ->(',xV'>y).°a ) 

Equiv(?x.?a)  A  Equiv(?v.?b)  —  Equiv(?xA?y.?aA?b) 

Equiv(?x.?x) 

is  shown  in  Figure  4.7.  The  general  fact  learned  from  this  exam- 


Equiv(-(-?a,/-'?b).  ?aA?b) 


Had  generalization  been  performed  by  simply  changing  constants  to  variables,  the  result  would 
have  oee.n  the  overly  specific  rule  shown  below: 

Equivf -(-(?xV?y)V-*(?zV?w)).  (?xV?y)A( ?zV?w)) 
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Figure  4.5:  Circuit  Design  Learning  Problem 
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Equiv( -( ->(aVb)V-(cV  d)  l.(  a  ^b)A(c'/d)) 

t 

Equtv(-(-'(aVb))A-'(-(cVd)).(aVb)A(c'/d)) 


Equiv(  ->(->(  aVb)).aVb) 
A 

EquivCa’x  b.avb) 


Equi  v(  -<( -( cVd ))  .cVd ) 

A 

Equiv(cvd.c  /d) 


Figure  4.6:  LEAP  Example  —  Specific  Explanation 


Equiv(  ->(->(  ?al)''/'-,r1bl  )).?alA?bl ) 

f 

Ea  u  i  v(  ->(->(  ?a  1 )  )A-(  -«( 1 ))  .'’a  1  A?b  1 ) 


Equiv(-«(-(?al)).?al) 

A 

Eauiv(?al.?al) 


Equiv(-*(-(?bl)).?bl) 

A 

Equiv(?bl.?bl) 


Figure  4.7:  LEAP  Example  —  Generalized  Explanation 


As  a  result  of  the  explanation-based  approach,  the  resulting  generalization  is  not  sensitive  to  the 
fact  that  the  first  stage  of  the  circuit  involved  two  NOR  gates. 

For  example,  the  generalization  would  support  using  two  N AND  gates  and  a  NOR  gate  to 
AND  four  inputs  together  as  shown  in  Figure  4.S.  The  rule  learned  from  the  previous  example  is 
capable  of  aiding  in  both  the  design  and  verification  of  this  circuit.  Given  the  query:  EquivC’x. 
(a'b)  ■  (cAd)))  the  deductive  retriever  "designs'-  the  circuit  shown  in  Figure  4.S.  and  given  the 
query  Equiv(-(-(aAb)  v  -(cAd)).  (a'<b)  A  (cAd)))  it  verifies  it.  The  learned  rule  allows  both  of 
these  tasks  to  be  done  in  one  step.  This  example  mcelv  illustrates  how  a  learned  rule  can  increase 
the  efficiency  of  both  the  construction  and  understanding  of  solutions  to  similar  problem. 


4.2.3.  NLA  Example 


Another  explanation-based  learning  system  in  the  domain  of  logic  is  MA  [0‘RorkeS4. 
O'RorkeSTb]  which  learns  proof  schemata  from  sample  natural  deduction  proofs.  When  the  system 
cannot  complete  a  proof  for  a  particular  theorem,  a  teacher  steps  in  and  completes  the  proof.  MA 
then  generalizes  the  teacher's  proof  in  an  explanation-based  manner  to  generate  a  proof  schema  that 
can  be  used  to  solve  future  problems.  Consider  a  variant  of  the  example  discussed  in  [0'Rorke84] 
of  proving  a  particular  case  of  the  law  of  excluded  middle:  .\IL=> (PAQ)V-(PAQ)  (i.e. 
(P'QVv-iPAQj  can  be  deduced  from  the  empty  set  of  assumptions).  The  natural  deduction  proof 
the  deductive  retriever  generates  for  this  example  is  shown  in  Figure  4.9.  The  following  rules  of 
natural  deduction  [Manna74]  are  employed  in  this  proof: 


Assumption  Axiom: 


(7s  •  ?v)=>^x 


Or  Introduction : 


'1x=>?v  —  ,x=0?vV?z 


Or  Introduction: 


?x=c>?v  —  ?x^>?zV?v 


Elimination  Of  Assumption:  ('’x  .  ?v)=0?z  A  (-7x  .  ',y)=^?z  — •  ?y=0',z 


The  expression  '>x=>?y  means  that  the  wff  0y  is  deducible  from  the  list  of  assumptions  (wffs)  °x. 
LISP  dot  notation  is  used  to  represent  lists  of  assumptions.  The  generalized  proof  EGGS  generates 
for  this  example  is  shown  in  Figure  4.10.  From  a  specific  instance  of  proving  (PAQ)V. -<(PAQ)  from 
no  assumptions,  a  general  proof  is  learned  for  proving  the  disjunction  of  any  wff  and  its  negation 
from  any  set  of  assumptions.  .Notice  again  that  simply  changing  the  constants  P  and  Q  to  variables 
would  have  resulted  in  an  under-generalization.  The  mere  general  fact  learned  by  EGGS  allows  it 


to  solve  one  test  problem:  NIL=>PV-P  m  one  step. 


s 


7yl~  =>  ?yl6V-?yl6 


(°yl6  .  ';yl7)  =o  '’yl6V-7yl6 
A 

I 

Pyl6.?yl7)  =>  '’y  16 


t-’yl b  .  7yl7)  =>  ->yl6V-?yl6 

A 

(-7yl6 . 7yl")  =o  -?y!6 


Figure  4.1Ch  MA  Example  —  Generalized  Explanation 


The  same  proof  and  generalization  shown  in  Figures  4.9  and  4.10  result  from  understanding 
the  following  sketchy  proof: 

((PAQ))  =0  (PA Q)  V  -(PA Q) 

(-(PAQ))  =0  (PAQ)V  -(PAQ) 

Q£.D:.  NIL  =>  (PAQ)  y  -.(PAQ) 

If  the  depth  bound  on  the  deductive  retriever  is  set  to  two.  explaining  this  proof  takes  only  4.5 
CPU  seconds,  compared  to  proving  the  theorem  directly  w'hich  takes  9  CPU  seconds  with  a  depth 
bound  of  three.  18.5  CPU  seconds  with  a  depth  bound  of  five,  and  759  CPU  seconds  with  a  depth 
bound  of  ten!  This  simply  demonstrates  that,  as  expected,  understanding  a  proof  can  be  much 
more  efficient  than  independently  discovering  one. 


S' S' 


4.2.4.  LEX2  Example 


I 


I 


LEX2  [Mil  cheiiS3]  is  an  explanation-based  learning  system  in  the  domain  ol  integration  prob¬ 
lem  solving.  Unlike  the  other  systems  addressed  in  this  paper  which  learn  new  inference  rules  or 
plans.  LEX  learns  search  control  knowledge.  Specifically,  it  learns  heuristics  for  when  to  apply  par¬ 
ticular  integration  operators  by  determining,  in  an  explanation-based  manner,  why  the  application 
of  a  particular  operator  led  to  the  solution  of  a  specific  problem.  The  formulations  of  LEX2's 
approach  given  in  [MitcheilS6]  and  [Hirsh87]  use  a  set  of  logical  inference  rules  to  prove  that  a  par¬ 
ticular  operator  was  ‘useful"  in  solving  a  particular  problem.  However,  macro-rules  for  solving  a 
particular  class  of  integration  problems  can  also  be  learned  in  an  explanation-based  manner,  and 
the  conditions  that  allowed  each  operator  to  lead  to  the  solution  are  easily  extracted  from  the  gen¬ 
eralized  explanation.  For  example,  consider  solving  the  following  problem  discussed  in 
[MitcheilS6]:  / 3x:dx.  The  solution  to  this  problem  can  be  formulated  as  logical  inference  using  the 
following  inference  rules: 

Cor.stant(?r)  A  ?r^-l  —  Solution(/?x7:d?x.  ?x’r*1/(?r-l)) 

Constant(?c)  A  Solution(/?fd?x.  ?z)  —  Solution(/?c*?fd?x.  ?c*?z) 

Xumber  p(?x)  —  ConstamC*x) 

The  expression  Soiution(?x.?y)  means  ?y  is  the  solution  to  the  problem  °x.  The  proof  EGGS  gen¬ 
erated  as  the  solution  to  the  given  problem  is  shown  in  Figure  4.11.  and  the  generalized  proof  is 
shown  in  Figure  4.12.  The  general  rule  learned  from  this  example  is: 

NumberpUcl)  A  X'umberpC’rl)  A  ?rl^-l  —  Solution(/?cl*0xl’rld?xl.  ?cl*<?xl7rl*1/(?rl-rl))) 

This  rule  can  then  be  used  to  immediately  solve  similar  problems  like:  /5x3dx.  The  conditions 
that  allowed  the  first  operator  to  lead  to  a  solution  are  also  implicitly  representec  in  this  rule.  The 
problem  must  match  the  form  in  the  conclusion:  /7cl*?xl’rld?xl  and  satisfy  the  antecedents  of  the 
rule:  NumberpUcl ).  Numberp(?rl).  and  In  a  problem  with  more  steps,  the  "useful"  con¬ 

ditions  for  other  operators  in  the  solution  can  be  obtaining  by  pruning  fi.e.  removing  from  the 
explanation  structure)  the  steps  for  all  the  previous  operators  in  the  solution  prior  to  generalizing 
the  explanation  structure.  This  will  create  a  macro-rule  for  the  last  n  steps,  and  the  “useful"  con¬ 
ditions  for  the  first  operator  in  this  macro-rule  can  then  be  extracted  as  described  above. 

Unlike  the  process  for  learning  integration  probiem  solving  described  in  [MitcheiiS6j,  this 
method  obtains  both  a  general  macro-rule  and  heuristic  conditions  from  a  singie  explanation  using 
a  single  generalizing  mechanism.  The  methoc  described  m  [Mitche!186]  iearns  only  search 


V.  ■ 


i.'.v.*  y-v\vv  >>\  .\v- vv v>  v\ 
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Solution! /3x'dx.3(x-'1/(2-rl ))) 


Solution! /x2  dx.x2‘1/(2+l)) 


Constant!3) 


Constant(2) 


Xumberp(3) 


Numberp(2) 


Figure  4.11:  LEX2  Example  —  Specific  Explanation 


Solution!  /?cl*?xrrld?xl.?cl*(?xl’rl*1/(?rlTl ))) 


Solution!  /?xrrId?xl.?xl’rl*l/(?rl-l)) 


Constant(?cl) 


’rl^-1 


Constant(?rl) 


Numberp(?cl) 


.\umberp(?rl) 


Figure  4.12:  LEX2  Example  —  Generalized  Explanation 


heuristics  and  uses  the  EBG  method  as  well  as  a  separate  regression  step  using  rules  of  the  domain 
theory  to  regress  operators  through  problem  states.  If  the  problem  solving  operators  are  explicitly 
represented  in  the  rules  of  the  domain  theory,  this  separate  regression  step  is  unnecessary.  Like  the 
approach  presented  here,  the  formulation  given  in  [HirshST],  explicitly  represents  operators  in  the 
domain  rules  in  order  to  eliminate  the  unnecessary  regression  step.  However,  like  Mitchell  et.  at 's 
: ormuiaticn.  Hirsh  s  approach  iearns  only  search  heuristics  for  individual  operators  and  not 
macrc--uies  The  approach  presented  here  learns  both  heuristics  and  macro-rules;  however,  it 
requires  a  specialized  technique  for  extracting  the  heuristics  from  the  generalized  explanation. 


aOu\ 


4.2.5.  Geometry  Example 

Another  mathematical  domain  to  which  EGGS  has  been  applied  is  proving  theorems  in  ele¬ 
mentary  Euclidian  geometry.  J  R.  Anderson  has  used  his  production  system  learning  models  ACT* 
[AndersonS3a.  AndersonS3b]  and  PUPS  [AndersonS7b],  to  model  human  learning  in  this  domain. 
EGGS  is  capable  of  learning  general  geometry  proof  rules  analogous  to  the  productions  produced  by 
Anderson's  composition  process.  These  learned  rules  can  then  be  used  to  solve  similar  problems 
more  efficiently  in  the  future. 

For  example,  consider  the  problem  shown  in  Figure  4.13  which  was  taken  from 
■  AndersonS7b],  The  domain  rules  needed  to  solve  this  problem  are: 

Redexivity:  °x  =  0x 

Side  Side  Side:  ’’a^b  =  ?x°y  A  ?b?c  =  ?y7z  A  ?c?a  =  ?z?x  —  A?a?b?c  =  A?x?v,>z 

Line  Axiom:  ?x?y  =  ?v'1x 

Equality  Axiom:  ?x  =  N  A  7x  =  ?z  —  ?y  =  ?z 

Equality  Axiom:  ?x  =  °y  A  °2  =  ?x  -*  ?z  =  °y 

Given  these  ruies.  EGGS'  backward  chaining  inference  engine  constructs  the  proof  shown  in  Figure 
4.14.  The  generalized  proof  produced  by  EGGS  is  shown  in  Figure  4.15.  The  general  composed  rule 
learned  from  this  problem  solution  is: 

?a6‘,z3  =  ?zrb5  A  7a6?a5  =  7a5‘,b5  -  A?a6?a5?z3  =  A?b57a5°z3 

This  rule  is  analogous  to  the  composed  production  learned  by  ACT*  since  the  simple  method  of 
replacing  each  constant  with  an  independent  variable  produces  the  correct  result  for  this  particular 


GIVEN:  AB  =  BC 
AD  =  DC 
ABC 

PROVE:  AABD  =  ACBD 


Figure  4.13:  Geometry  Learning  Problem 


m 


DA  =  DC 


BD  =  BD 


AB  =  CB 


AD  =  DC 


AD  =  DA 


AB  =  BC 


BC  =  CB 


Figure  4.14:  Geometry  Example  —  Specific  Explanation 


A?a69aS?z3  =  A',bS',a5',z3 


ao  =  ?z3?b5 


">aS?z3  =  ?aS?z3 


",ao,a5  =  ’>b59a5 


'>ao9z3  =  '>z37b5 


',ao7z3  =  ?z3',a6 


7a67a5  =  ’’aS’bS 


"’aS’bS  =  '>b5?aS 


Figure  4.15:  Geometry  Example  —  Generalized  Explanation 


example.  However,  as  many  of  the  other  examples  illustrate,  this  simple  generalization  method 
frequently  produces  overly-specific  generalizations. 

The  rule  learned  by  EGGS  is  capable  of  directly  solving  similar  problems  such  as  the  one 
shown  in  Figure  4.16.  Since  the  solution  to  the  previous  problem  did  not  depend  on  the  bottom 
three  points  being  colinear.  the  general  proof  used  for  the  previous  problem  can  also  be  used  for 
this  problem. 

An  additional  geometry  problem,  shown  in  Figure  4.17  (taken  from  [AndersonS3a]  and  origi¬ 
nally  from  [Jurgenson75]).  is  difficult  enough  so  that  the  inherent  limits  of  the  machine  (a  Xerox 
110S)  prevents  the  EGGS  theorem  prover  from  solving  the  problem.  After  2S  minutes  of  run  time 
with  a  depth  bound  of  nine  (the  depth  of  the  shortest  known  proof),  the  system’s  storage  ;s 
exhausted  and  a  proof  has  still  not  been  found.  However,  understanding  the  following  sketchy 
or  oof  with  a  deoth  bound  of  three  lakes  oniv  7S  seconds  of  CPU  time. 
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GIVEN:  AB-BD 
AC  =  CD 


!  v\ 


PROVE:  A  ABC  =  . 


Figure  4.16:  Geometry  Test  Problem 


£  GIVEN: 

AGB  fge 

/ 

ECA  BDF  DGC 

Vc 

G  is  midpoint  of  AB 

c  G 

G  is  midpoint  of  CD 

\  PROVE: 

G  is  midooint  of  EF 

D  B 

Figure  4.17:  Difficult  Geometry  Example 

Q^-.D  .  Midpoint!  G.EF  ) 

This  example  dearly  demonstrates  the  importance  of  being  able  to  learn  from  hints  and  the 
observed  problem  solving  behavior  of  other  agents.  Explanation-based  learning  from  observation 
allows  a  system  to  learn  how  to  solve  problems  that  it  was  previously  unable  to  solve  due  to 
.nherer.t  computational  resource  limitations. 


4,2.b.  Deductive  STRIPS  Planning  Example 


As  described  :r.  [Xiissor.SO]  and  [Kowalski79 ].  traditional  blocks  world  planning  tan  be  for¬ 
mulated  as  logical  deduction.  The  following  discussion  will  follow  what  Nilsson  relers  to  as 
Kowalski's  formulation"  of  planning  as  deduction.  In  this  formulation,  states  and  actions  are 
treated  as  terms  instead  of  predicates.  Holds(9f.9s)  means  that  the  fact  °f  holds  m  the  state  ?s.  The 
term  doita.'V-  represents  the  state  achieved  by  performing  the  action  ’’a  in  state  ns.  PcsslTs)  means 
tnat  "’s  is  a  possible  state.  ;.e.  one  that  can  be  reached  from  the  current  state.  Pact(',a.'1s)  means 
that  it  is  possible  to  periorm  action  ?a  in  state  ">s.  i.e.  all  of  ’as  preconditions  hold  in  state  9s. 
DiffC’x.?'/)  means  that  °x  and  ?v  are  different,  distinct  terms.  Finally.  Achieve(?f .?s)  means  that 
fact  '’f  is  achieved  in  state  9s.  i.e.  ?f  holds  in  '’s  and  ?s  is  a  reachable  state.  A  set  of  axioms  for  han¬ 
dling  examples  from  the  STRIPS  domain  [Fikes?2]  is  given  below. 

Hoids(?f.?s)  A  Posse’s)  —  Achieve(?f.9s) 

Pactf°a.9s)  7.  PossC^s)  —  Poss(do(9a.?s)) 


Efiec:  Axioms: 

Holds(inroom(9a.?0.do(gothru(9a.?d.?k.?l).?s)) 

Ho  ids(  inroo  m(  ?b.?I )  .co(  pushthru(?a.?b.?d  .9k  .?1 )  .?s) ) 
He  idsf.  in  roo  ta(?a.?I )  .dot  pushthru(?a.?b.?d.nk .?!  ).?s) ) 


F^crne  Axioms: 

Hoids(?v.?s)  •'  DiffCX  ,inroom(?a.?k))  '  Diff(?v.inroom(?b.?k))  — 

Holds1. 'X-. dd  pushthruf  ',a.',b.9d.',k.',l).'’s)) 

Holdsl^vTs)  •  Diffr,v.inroom',?a.?k))  —  Holds('>v.do(gothru(°a.?d.?k.?l).?s)) 


precondition  Axioms: 

Agent  ‘Pi.  '  He  Ics1  inroom(?a.9k).?s)  ■'  Holds(connects('*d.?k.?l).?s)  — 
Pac:igothruraTdTk.T).9s) 

Agent?  9a  *  ‘  Hcids;  inroorm'?a.9k).?s)  P  Hoids(inroom(?b.?k).?s)  A 
Hoidsfcor.nectsi ’’dAk '’s)  -  Pact( pushthruf ?a.'>b.9d.?k.?l).?s) 


s 


Miscciicnentis  Axioms 

A70MCV  ■'  \OT-‘EQ« ’’xTai)  -  Ditf('>x.°a) 


Diffi  x. -  a;  -  Di.T((',x  .  'V).{',a  .  °b  0 
Difft  ?v.?b)  -  DiffiPx  .  '’>•).( '’a  .  ?b)) 

Ho!dsUonnects(?d.?l.?k).?s)  —  Holds!connects(?d.?k.?l).?s) 

Given  the  following  facts  describing  the  initial  state  of  the  world  shown  in  Figure  3.6. 

Poss(SO).  Age.nt(Robot).  Holds! inroomf.Robot. Room  1  ).SO). 

Holds!inroom(Box.Room2).S0).  Holds(conr.ects(Doorl .Rooml .Room2).S0) 

and  the  goal:  Achieveunrooml Box.Rooml).?s).  the  deductive  retriever  constructs  the  proof  shown 
in  Figure  4.18.  The  generalized  proof  for  this  example  is  shown  in  Figure  4.19.  The  explanation  has 
beer,  pruned  to  remove  the  rule:  Not(Eq(?bl.?al4))  A  Atom(?bl.?al4)  —  DiffObl ,?al4).  In  order  to 
increase  generality,  the  pruning  algorithm  removes  instances  of  this  general  rule  that  add  addi¬ 
tional  antecedents  to  the  macro-rule.  The  antecedents  of  the  rule:  Not(Eql  Connects. inroom))  A 
Atom(Connects)  —  DifF(connects.mroom)  are  general  facts  of  the  domain  and  therefore  are  not 
added  to  the  antecedents  of  the  macro-rule.  Consequently,  this  rule  does  not  detract  from  the  gen¬ 
erality  of  the  final  result  and  is  not  pruned.  The  macro-rule  learned  from  the  final  generalized 
explanation  is: 

Poss(?sS)  A  Hoids!connects('>d5.'1k5.'>k3).?sS)  A  Holds(inroom(',al4.!>k5).?s8)  A  Agent(?al4)  A 
Holds(connects(',d3.?!3Tk3).‘1sS)  A  DiffC?bl.?al4)  A  Holds(inroomC1bl.?k3).?sS)  — 

Achieve!  inroom(',blT!3).do(pushthru(!)al4.?bl.?d3.?k3.?13).dc(gothru(?al4.?d5.?k5.?k3).?sS))) 

Notice  that  this  generalization  allows  the  destination  of  the  box  (I’D)  to  be  different  from  the  room 
in  which  the  robot  started  C’k5).  As  discussed  earlier,  a  generalization  produced  by  simply  chang¬ 
ing  each  constant  to  an  independent  variable  would  require  them  to  be  the  same  like  they  were  in 
the  example.  On  the  other  hand,  the  macro-rule  learned  by  EGGS  can  be  used  to  solve  the  following 
problem: 

Gcal:  Achieve! :nroom( 3ox.Room3)Ts) 

Given:  Poss(SO).  Agent(Robot).  Holds(inrocm(Box.Rocm2).S0) 

Holds! inroom! Robot. Room  1  ).S0).  Holds!connects(Doorl  .Rocml  ,Room2).SO) 

Holds! connects!  Door2.Room3.Room2).SO) 

In  addition  to  being  abie  to  generate  plans,  this  formulation  can  also  easily  be  used  to  verify 
that  a  particular  plan  works.  For  example,  the  STRIPS  plan  can  be  verified  instead  of  generated  by 
simply  proving  the  following  assertion: 


Ot'V.  ^V>>>V-7v> /■,  /  ^^>v/.y»vv>N'T 
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1 


Achieve  1 


Poss3 


Pcss5 


Holdsl  7 


Pact-1 


Possl 


Holdsl 


Pac:3  Hole 


Holdsl4  Holdsl6 


✓Agentl  Dirf  2  Holdsl2  Diff6  Holds2 


Holds3 


N'ot2  Atom3 


DiffS 

t 

DifflO 


Noil  Atom2 


Achieve  1  Achieve(inroom(Box,Rooml).do(pushthru(Roboi.Box.Doorl,Room2.Rooml ), 

do!  gothru!  Robot. Door  1  .Room  1  ,Room2  ),SO ) ) ) 
?oss5  Poss! do(  pushthru!  Ro bot. Box. Door  1  ,Room2. Room  1 ). 

do!  gothru!  Robot, Doorl. Room  l.Roo  m2), SO))) 

Possi  Poss!  do(  gothru!  Robot. Door  1  .Room  1  ,Room2  ).S0) ) 

Possl  Poss(SO) 

Pact  3  Pact<gothru!Roboi,Doorl,Rooml.Room2),SO) 

Holds3  Holds!  connects(  Door  1, Room  1  ,Room2),S0) 

Holdsl  Holds!  inroom! Robot. Room  1  ).S0) 

Agent  1  Agent(Rcbot) 

Pact  4  Pact!  push  thru!  Robot,  3ox.  Door  l.Roo  m2.  Room  1 ). 

do!  gothru!  Ro  bo  t. Door  1. Room  l.Roo  m2  ).S0) ) 

Ho  Ids  10  Ho  Ids!  connects!  Door  l.Roo  m2, Room  l  ). do!  gothru!  Robot.Doorl.Rooml.Room2).SOi) 

Diff2  Diff(connects(  Doorl  .Room2, Rooml  ),inroom(Robot,Rooml )) 

Diff4  DifHconnects. inroom) 

Not2  Not!  Eq(connects.inroom)) 

Atomo  Atom!  connects) 

Ho  Ids  12  Holds!  connects!  Door  1  ,Roo  m2, Room  1 ), SO) 

Holds  1-1  Holds!  inroom!  Box.Roo  m2), do!  go  thru!  Robot. Doorl. Room  l.Roo  m2). SO)) 

Di?6  Diff!  inroom!  Box. Roo m2 ). inroom! Robot, Roo ml )) 

Dtff8  Diff!Box(  Room2 ). Robot!  Rooml )) 

DifflO  Diff!  Box. Robot) 

N'otl  Not!  Eq!  Box. Robot)) 

Atom!  Atom!3ox) 

Holds2  Holds! inroom!  8o\\Room2  ).SO) 

Holds  16  Holdsl  inroom!  Robot, Roo  m2  ).do(  gothru!  Robot.  Doorl  .Room  1  .Roo  m2). SO ) ) 

Holdsl  *  Holds!  inroom!  Box,  Room  1  J.do!  pushthru(Robot.Box.Door  1 ,  Roo  m2.  Rooml  ). 

do! gothru!  Robot. Doorl  .Rooml  .Room 2  ).S0) )) 

Figure  4.18:  Logic  STRIPS  Example  -  Specific  Explanation 
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Achieve  1 


Achieve  I 

Achieve!  inroom!  7b  1  .713  ),do(  push  thru!  ?a  1 4,?bl  ,7d3.?k3.7l3 ). 

do(  go  thru(  7a  14.7d5  ,7kS  />k3  ),7s8 ) ) ) 

Poss3 

?oss(do(pushthru(',al4,?bl.?d3.7k3.?13),do(gothru(',al4.7d5,?k5.'’k3),?s8))) 

g 

PossS 

?css(  do(  go  thru(  ’a  1 4,'’d5  ,',k5.7k3 )  ,7s8 )) 

Posso 

Poss(?s8) 

-  , 

Pad  3 

?act(  gothru(  ’a  1 4,?dS  .^kS  ,7k3  ).?s8 ) 

Holdsb 

Holdsl  connects! ',d5,7k5.?k3).‘,s8 ) 

*% 

Holdso 

Holdsl  inroomC’al  4, ’k5  l.^sS ) 

Agen:4 

Agent!  7al4) 

■ 

?ac;4 

Pac:(pushthru!,al4.?bl.?d3.‘’k3.?13).do(gothrtiC,al4,?d5.?k5.?k3).7s8)) 

p 

Hold?  10 

Holdsl  connects!  7d3.,k3.?13).dolgothru(?al4.?d5,?kS,7k3),7$8).1 

Did: 

Difff  connects!  '’d3.''k3.‘,l31.inroom(7al4,7k5)) 

Diff4 

Diffl  connects. inroom) 

Not  3 

Not!  Eql  connects. inroom)  1 

V* 

Atorr.4 

Atomlconnects) 

Ho Ids  12 

Holdsl  connects!  7d3.,k3.7l3  ),ns8 ) 

Holds  13 

Holds!  connects!  7d3.7l3,7k3  l.^sS ) 

Holdsl  4 

Holds!inroomC’bl.?k3),do(gothru(7al4,?d5,?k5.’>k3).'ls8)) 

Diffo 

Diff(  inroom(?bl,?k3),inroom(7al4,?kS)) 

Diff8 

Diff(?bl(7k3).'?al4!‘:k5)) 

%■ 

_  * 

Diff9 

Diffi’bl.?al4) 

% 

V 

Holds  15 

Holdsl  inroom!'lbl.',k3).7s8  ) 

Holdslb 

Holdsl  inroom(',al4,',k31,do!gothrul?al  J,"’d5.',k5.',k3).''s8 )) 

Holdsl  ~ 

Holds!  inroom!  7b  1.713  l.dol  pushthru(7al4,7bl,7d3.’’k3.'>13 ), 

do!  gothru! 'a  14.7d5 .’kS .?k3).1s8  ! ) ) 

Figure  4.19:  Logic  STRIPS  Example  —  Generalized  Explanation 


AchieveCinroomi  Box. Room  1  ).do(  pushihru(  Robot. Box. Door  1  Room  2. Room  1 ). 

do(gothru(.Robot.Doorl  .Rocml  .Room2).S0))) 


Verifying  plans  is  useful  for  constructing  explanations  for  observed  pians  and  thereby  learning 
general  plans  from  observed  behavior. 
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CHAPTER  5 

EGGS:  REWRITE  EXPLANATIONS 
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Many  mathematical  problems  can  be  more  concisely  represented  and  solved  using  rewrite 
’ ~J.es  [BundyS3]  instead  of  logical  deduction.  Domains  that  involve  the  manipulation  of 
mathematical  formulae  generally  require  rewriting  subterms  of  an  expression.  Rewriting  subterms 
using  icgicai  deduction  requires  the  representation  and  skillful  application  of  equality  axioms.  On 
the  other  hand,  the  type  of  equality  reasoning  generally  required  in  most  mathematical  domains  is 
a  natural  part  of  a  mechanism  that  applies  rewrite  rules.  Consequently,  a  rewrite  system  can  be  a 
more  efficient  way  of  solving  particular  types  of  problems.  The  empirical  data  presented  in  chapter 
7  shows  that  the  EGGS  rewrite  system  can  be  about  one  and  a  half  to  four  times  faster  than  the 
EGGS  theorem  prover  at  solving  the  same  problem. 

Like  the  previous  section  on  logical  proofs,  this  section  discusses  the  EGGS  system's  ability  to 
construct,  generalize,  and  learn  macro-rules  from  specific  rewritings.  It  also  presents  a  few  exam¬ 
ples  of  applying  EGGS  to  various  domains  that  use  rewritings  as  explanations. 

5.1.  Rewritings  as  Explanations:  The  LEAP  Example 

Like  logical  proofs,  chains  of  rew-rite  rule  applications  can  be  represented  as  explanations. 
Each  rewritten  expression  is  a  pattern  in  the  explanation  and  the  chaining  together  of  two  rules  is 
represented  by  a  unification  between  the  right-hand-side  (RHS)  of  the  first  rule  and  the  left-hand- 
side  (LHS)  of  the  second.  For  example,  consider  solving  the  LEAP  example  discussed  in  section 
4.2.2  by  rewriting  the  expression  -(-’(aVb)V-(cVd))  to  (aVb)A(cVd)  using  the  following  rewrite 
rules: 

--Gx  —  ?x 

-Ax  7  ?y )  -  -?X  A  ->?y 

The  explanation,  explanation  structure,  and  generalized  explanation  for  this  example  are  shown  in 
Figure  5.1,  Figure  5.2.  and  Figure  5.3  respectively.  Notice  that  if  a  rule  rewrites  only  a  subterm  of 
an  expression,  "dummy  variables"  are  added  to  the  pattern  in  the  explanation  structure  to  fill  out 
the  expression  so  that  it  can  be  unified  with  the  previous  expression.  LISP  list  notation  is  used  in 
the  illustration  of  the  explanation  structure  since  the  technique  used  to  add  dummy  variables  relies 
on  the  underlying  list  representation.  For  example,  after  applying  De.Morgan's  Law  in  the  LEAP 
problem,  the  rule  for  eliminating  a  double  negation  is  used  to  rewrite  the  first  term  in  the  resulting 


*'( -,(av'b)v'->(cVd)  i 


->(-'(aVb))A-.(-(cVd)) 


s 


(aVb)A-i(-(c\/d)) 


(aVb)A(cVd) 


Figure  5.1:  LEAP  Rewrite  —  Specific  Explanation 
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-(-■(?f4)v-(?x3 )) 


->(-«(  ?f  4 )  )A-(  -( ?x3 ) ) 


?f4A-(  -«( ?x3)) 


7f4/.',x3 


Figure  5.3:  LEAP  Rewrite  —  Generalized  Explanation 


conjunction.  In  order  to  allow  the  LHS  of  this  rule  to  unify  with  the  RHS  of  the  instance  of 
DeMorgan's  Law  in  the  explanation  structure,  this  rule  is  padded  with  the  dummy  variables  ?fl 
and  ?f 2.  During  generalization,  this  unification  results  in  the  following  substitution: 

(and/?fl.  not(?x2)/?xl.  (not(,yl))/'>f2} 

Next,  the  rule  for  eliminating  a  double  negation  is  used  to  rewrite  the  second  term  in  the  conjunc¬ 
tion  and  this  instance  of  the  rule  is  padded  with  the  dummy  variables  ?f 3 .  ?f4,  and  ?f 5  in  order  to 
allow  its  LHS  to  unify  with  the  previous  instance.  During  generalization,  this  unification  results  in 
the  following  substitution: 

iand/?f3.  ?f 4/7x2.  noi(?x3)/?yl.  ML.'?f5| 

Applying  the  composition  of  these  two  substitutions  to  the  explanation  structure  results  in  the  gen¬ 
eralized  explanation  shown  in  Figure  5.3. 

5.2.  Facilities  in  EGGS  Supporting  Rewrite  Explanations 

Figure  3.4  illustrates  the  components  in  EGGS  that  use  and  learn  rewrite  rules.  The  rewrite- 
rule  engine  allows  EGGS  to  construct  rewrite-rule  explanations  by  solving  problems  itself.  These 
explanations  can  then  be  generalized,  resulting  in  macro-rules  that  can  be  used  to  solve  similar 
problems  more  efficiently.  A  system  for  understanding  sketchy  rewritings  provided  by  the  user 
would  also  allow  the  system  to  operate  as  a  learning  apprentice.  Although  the  current  EGGS  sys¬ 
tem  is  not  equipped  with  such  a  sub-system,  it  would  not  be  difficult  to  construct  one.  An  example 
of  such  an  understanding  system  is  discussed  in  [BennettS6l. 
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5.2.1.  Rewriting  Expressions 


There  are  a  number  of  techniques  for  using  rewrite  rules  to  solve  problems.  If  a  set  of 
rewrite  rules  is  canonical  [Bundv83].  any  sequence  of  legal  rewrite  rule  applications  will  eventu¬ 
ally  convert  an  expression  to  canonical  form.  If  a  rule  set  is  not  canonical,  it  is  necessary  to  search 
through  the  space  of  possible  rewritings  until  an  expression  is  found  that  meets  a  specified  criterion. 


EGGS  is  equipped  with  both  a  deterministic  rewrite  system  for  applying  canonical  rule  sets 
and  a  backtracking  rewrite  system  for  rule  sets  that  are  not  canonical.  The  deterministic  system 
simply  applies  a  rule  to  rewrite  the  current  expression  or  one  of  its  sub-expressions.  This  process  is 
continued  until  no  more  rules  apply.  The  backtracking  system  searches  possible  rewritings  in 


either  a  depth-first  or  breadth-first  manner  until  a  supplied  goal  predicate  is  satisfied. 


[n  both  cases,  there  are  various  ways  of  choosing  which  rule  to  apply  at  each  step.  For  the 
deterministic  system,  this  choice  determines  the  length  of  the  eventual  path  to  the  canonical  form. 
For  the  backtracking  system,  this  choice  also  effects  the  amount  of  search  required.  In  his  book. 
Bundy  presents  two  methods  for  choosing  which  expression  to  rewrite.  The  first  is  to  rewrite  the 
leftmost/ innermost  subexpression  that  can  be  rewritten.  This  method  is  called  innermost  or  call  by 
value.  The  second  is  to  rewrite  the  leftmost/outermost  subexpression  that  can  be  rewritten.  This 


method  is  called  outermost  or  call  by  name.  In  order  to  allow  learned  macro-rules  to  have  priority 


over  solving  problems  from  scratch,  a  third  technique  called  rule  ordered  is  used  in  EGGS.  In  this 
technique,  the  rewrite  rules  are  ordered  and  the  sub-expression  that  matches  the  LHS  of  the  the 


highest  numbered  ruie  is  rewritten. 


The  three  examples  presented  in  this  section  all  use  the  deterministic  rewrite  system.  Com¬ 
plete  rule  sets  capable  of  solving  any  problem  in  the  domains  discussed  would  not  be  canonical  and 
would  require  using  the  backtracking  system.  For  the  LEAP  example  discussed  above,  the  deter¬ 
ministic  rewrite  engine  produces  the  rewriting  shown  in  Figure  5.1. 

5.2.2.  Learning  Rewrite  Macro-Rules 

After  the  solution  to  a  specific  rewriting  problem  is  determined  by  the  rewrite  rule  engine,  the 
EGGS  generalization  algorithm  is  used  to  produce  a  generalized  explanation  like  that  shown  in  Fig¬ 
ure  5.3.  A  rewrite  macro-rule  is  easily  extracted  from  the  generalized  explanation  by  taking  the 
first  expression  in  the  chain  as  the  LHS  and  the  last  expression  as  the  RHS.  The  new  rewrite 
macro-rule  that  is  learned  from  the  LEAP  example  is: 

-(->?f4V-?x3)  -  ?f4A?x3 

A  learned  macro-rule  is  added  to  the  beginning  of  the  list  of  rewrite  rules  so  that,  due  to  the 
rule  ordered  method,  it  is  tried  first  when  solving  subsequent  problems.  As  a  result,  performance  is 
improved  on  subsequent  problems  that  can  be  solved  using  the  macro-rule.  However,  as  with  logi¬ 
cal  inference  macro-rules,  unsuccessfully  trying  to  apply  the  new  rule  will  slightly  degrade  perfor¬ 
mance  on  other  problems.  Some  empirical  data  on  the  effect  of  learning  on  rewriting  performance  is 
given  in  chapter  7. 


5.3.  Rewrite  Examples 

This  section  presents  two  more  examples  of  rewrite-rule  explanations  m  addition  to  the  LEAP 
example  discussed  above.  They  include  a  rewrite  formulation  of  the  LEX  exampie  presented  earlier 
and  an  example  from  the  domain  of  solving  algebraic  equations. 


5.3.1.  Rewrite  Version,  of  the  LEX2  Example 

Examples  from  LEX2's  domain  of  solving  integrals  can  also  be  more  concisely  formulated  as 
rewritings.  Consider  the  exampie  discussed  earlier  of  solving  the  integral:  fix2.  The  following 
two  rewrite  rules  are  needed  to  solve  this  problem: 

f'/'c^f )d°x  —  1c*/?f?d'7x  if  -OccursIn(?x.?c) 

fix  rd?x  —  nx’r"1  '(°r-l )  if  ?r^-l  h  -,OccursIn(?x.?r) 

The  if  conditions  attached  to  the  rewrite  rules  are  additional  conditions  that  must  be  met  for  the 
rewrite  ruie  to  apply  The  specific  and  generalized  explanations  generated  for  this  problem  by  the 
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EGGS  rewriting  system  and  generalizer  are  shown  in  Figure  5.5  and  Figure  5.6  respectively.  The 
rewrite  macro-ruie  learned  from  this  example  is: 

/( 0 f 3 * ( ? x 2 ’ r  1 ) ) d ° x 2  -  ?f3«((?x2’rl-I)/(?rl+l)) 

if  -OccursIn(?x2.?f3)  A  ?rl  A  -Occursln(?x2.'>rl ) 

This  rule  can  then  be  used  to  solve  additional  problems  such  as  /5x3  more  efficiently. 

5.3.2.  Equation  Solving  Example 

EGGS  has  also  been  tested  in  the  domain  of  solving  algebraic  equations.  The  LP  system 
LSil verS 3 .  SiKerSb]  learns  by  analyzing  individual  solutions  to  equations:  however,  it  learns  very 
abstract  "strategies"  which  are  not  guaranteed  to  solve  problems  to  which  they  can  be  applied. 


/(7*(x:))dx  "OccursInCx.?) 


7«/TF)dx 


7»((x2*l)/(2*l)) 


-OccursIn(x.2) 


Figure  5-5:  LEX2  Rewrite  Example  —  Specific  Explanation 


/(”  f3*(°x2’rI))d^x2 


-OccursIn(?x2.',f3) 


7f3V'f?xrr‘)d7x2 


’’rl^-1  -Occursln(',x2.?rl ) 


?f3*((?x2’rl-1)/(°rl*lj) 


Figure  5.6:  LEX2  Rewrite  Example  —  Generalized  Explanation 


/VVvV? 


V*. 

i  i* 


These  strategies  are  then  executed  in  a  flexible  manner  when  attempting  to  solve  future  problems. 
EGGS,  on  the  other  hand,  learns  more  specific  rewrite  rules  which  are  guaranteed  to  solve  a  particu¬ 
lar  class  of  problems. 

The  background  knowledge  used  in  this  domain  is  also  most  conveniently  given  in  terms  of 
rewrite  rules.  The  following  set  of  rules  can  be  used  to  solve  the  equation:  logt(x-rl)  *  loge(x-l)  -  c 
(from  [BundyS3]). 

log,b?u»'>v  —  ?u«,b,v  if  OccursIn(x.?u)  A  NotOccurIn(x.?v) 

',u-',v=',2  —  ?u**?z-r?v  if  OccursIn(x.°u)  A  NotOccurIn(x.?v) 

?u2=?v  —  ‘1U“Sqrt(?v)  if  OccursIn(x.?u)  A  \otOccurIn(x.?v)) 


5 

?u  2-?v  —  '7u=sq 

</ 

1 . 

(?u— ?v)*(?u-,’v) 

v: 

log,v.‘?u-iog,^?v 

The  if  conditions  attached  to  the  rules  constitute  control  information  which  assure  that  the  applica¬ 
tion  of  a  rule  is  a  step  towards  the  solution.  This  control  information  forms  a  part  of  the  explana¬ 
tion  and  is  learned  along  with  the  body  of  the  new  compiled  rewrite  rule.  These  conditions  are  pro¬ 
ven  deductively  using  the  EGGS  theorem  prover  and  the  following  inference  rules: 

1)  OccursIn(7x.?x) 

2)  OccursIn(',x.?y)  —  OccursIn(?x.(?v  .  ?z)) 

31  OccursIn(?x.?z)  —  OccursInC,x.(?v  .  ?z)) 

4)  Atom(?y)  A  -Eq(?x.?y)  —  ,\otOccurIn(?x.?y) 

5)  NotOccurIn(?x.?y)  A  NotOccurIn(?x.?z)  —  NotOccurIn(?x.(?y  .  ?z)) 

The  specific  explanation  for  the  solution  to  sample  equation  is  given  in  Figure  5.7.  If  this  explana¬ 
tion  is  generalized  without  pruning,  the  following  rule  is  extracted  from  the  generalized  explana¬ 
tion: 

log.bl(  x-''v3)-log,b[(x-?v3)-'>v2  —  x«sqrt((',brv:)-i-(?v3::)) 

if  AtomC’bl)  A  -Eq(x.?bl)  '■  Atom(?v2)  A  ->Ea(x.?v2)  A  Atom(?v3)  A  -Eq(x.?v3) 

Maintaining  the  complete  structure  of  the  original  explanation  has  required  ?bl.  ?v2.  and  ?v3  to  be 
atoms  as  they  were  in  the  example.  That  is.  in  the  example,  it  was  proved  that  ?bl  (the  base  of  the 
logarithm )  did  not  contain  x  because  it  was  an  Atom  that  was  not  Eq  to  x.  To  increase  the  general¬ 
ity  of  learned  macro-rules,  a  special  pruning  procedure  for  this  domain  automatically  prunes 
instances  of  inference  rules  1  and  4  from  the  explanation  structure.  The  generalized  explanation 
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produced  trcm  ihe  pruned  explanation  is  shown  m  Figure  0.6  Tne  r.e~  rewrite-ruie  learned 
lh:s  generalized  explanation  is: 

log,bl(,',yS-',v3)*log,bl('>yS-'>v3)=0y2  —  ?vS=sqrt((?brv:)-C?v32)) 
if  \otOccurIn(x.?bl)  A  NotOccurIn(x.?v2)  A  XotOccurlnCxTvS)  A  OccursIn(x.?yS) 

This  rule  is  at  the  appropriate  level  of  generality  to  cover  a  relatively  large  class  of  problems 
example,  this  rule  can  now  be  used  to  solve  the  following  test  example  in  one  step. 

log3..(x*2*a)  -  log3.?(x-2*a)=c-d 


x=sart((3*e)c'd-(2*a);:) 


Occursln  ”  SotlQ  Atomll  Not9  AiomlO 
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Figure  5.7:  Equation  Solving  Example  —  Specific  Explanation 
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Figure  5.8:  Equation  Solving  Example  —  Generalized  Explanation 
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EGGS:  PLAN  EXPLANATIONS 
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Although  constructing  and  verifying  plans  can  be  accomplished  using  logical  deduction  as 
described  in  section  4.2.6.  other  representations  and  algorithms  are  generally  more  concise  and 
efficient  [ChapmanS7.  Fikes72.  Sacerdoti74.  Sacerdoti77.  Waldinger77],  For  example,  verifying  the 
following  simple  STRIPS  plan  took  ten  times  as  much  CPU  time  using  the  EGGS  deductive  retri¬ 
ever  compared  to  a  more  specialized  algorithm  (discussed  later  in  this  section)  for  verifying  plans 
using  STRIPS  operators  (44s  versus  4.4s). 

Given.  InRoom(Robot.Rooml).  InRoom(Box,Room3).  Connects(Doorl.Rocml.Room2) 

ConnectsC  Door2  ,Room2  .Room3 ) 

Goal:  InRoom(Box.Room2) 

Plan:  GoThru(Robot. Door  1. Room l.Room2).  GoThru(Robot.Door2.Room2.Room3). 
PushThru(Robot.Box.Door2.Room2.Room3) 
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In  plan-based  domains,  it  is  generally  more  efficient  to  use  representations  especially  developed  for 
representing  actions.  This  section  discusses  the  EGGS  system's  ability  to  understand,  generalize, 
and  learn  macro-operators  from  plan-based  explanations  and  presents  a  couple  of  detailed  examples 
of  these  processes. 

6.1.  Plans  as  Explanations:  The  STRIPS  Example 

The  representation  of  actions  ’used  in  EGGS  for  pian-based  explanations  is  a  slight  variation  of 
STRIPS  operators  [Fikes72].  Actions  have  preconditions .  facts  that  must  be  true  in  order  to  perform 
the  action,  and  effects,  facts  that  are  true  after  the  action  is  performed.  Both  cdd  and  delete  items 
are  included  in  the  effects:  delete  items  simply  have  an  explicit  negation.  Existing  facts  that  are  not 
explicitly  negated  by  the  effects  of  an  action  continue  to  be  true.  Definitions  of  the  actions  used  in 
the  STRIPS  exampie  are  shown  in' Table  6.1.  The  departure  from  standard  STRIPS  operators  which 
use  separate  adc  and  delete  lists  allows  a  distinction  to  be  made  between  facts  that  are  know-n  to  be 
false  and  facts  that  are  simply  not  known  to  be  true.  This  is  because  facts  that  are  known  to  false 
'..e.  deleted  facts)  are  explicitly  asserted  as  negations  in  the  database.  Tius  in  turn  allows  for  nega¬ 
tive  preconditions  that  can  check  if  a  fact  .s  known  to  be  faLse.  and  allows  positive  effects  to 
oirectly  delete  negative  facts  in  the  database. 


Table  6.1:  STRIPS  Actions 


) 


Action  i  Preconditions 

Effects 

GoThru(9a.?d.?rl.?r2)  J  InRoom(?a.',rl ) 

!  Connects(?d.?rl ,?r2) 

InRoom('1a.7r2) 
-InRoom(?a.?rl ) 

PushThru(?a.?o.7d.?rl.?r2)  \  In.Room(?a.‘.’rl ) 

InRoom(7o.7rl ) 
Connects(7d.?rl  .?r2) 

i 

InRoom(',a.?r2)  ! 
InRoom(7o.?r2) 
-iInRoom(',a.7rl) 
'InRoomC1b.?rl ) 

Individual  actions  and  Horn-clause  rules  form  the  units  of  an  explanation  and  explanations 
are  connected  sets  of  actions  and  rules  in  which  preconditions  and  antecedents  are  equated  to  effects 
and  consequents.  The  specific  explanation,  explanation  structure,  and  generalized  explanation  for 
the  STRIPS  example  discussed  earlier  are  shown  in  Figure  6.1.  Figure  6.2.  and  Figure  6.3  respec¬ 
tively.  In  addition  to  action  definitions  given  in  Table  6.1.  this  example  makes  use  of  the  following 
inference  rule:  Conr.ects(?d.?rl.?r2)  —  Connects! ?d.?r2.?rl). 

6.2.  Facilities  in  EGGS  Supporting  Plan  Explanations 

Figure  6.4  illustrates  the  the  components  in  EGGS  that  use  and  learn  plans.  The  complete 
EGGS  system  is  equipped  with  a  general  purpose  sub-system  for  verifying  or  "understanding" 
plans  and  a  technique  for  obtaining  general  macro-operator  definitions  from  generalized  explana¬ 
tions.  The  current  system  does  r.ot  have  a  planner  for  independently  generating  its  own  plans: 


'InRoom(Robot.Room2)  InRoom(Box,Rooml)  -■InRoom(Box,Room2)  InRooml  Robot.  Room  1) 

PushThru(Robot,Box.Doorl.Room2,Rooml)  ,  _  „  , 

— — -r~ — -  -InRoom(Robot.Rooml) 


Connects(Doorl,Room2,Rooml)  InRoom(Box,Room2)  InRoom(Robot.Room2> 


GoThru(Robot.Doorl,Rooml.Room2) 

- - n 

_ _ — - - —  i 

Connects(Doorl,Rooml,Room2)  InRoomf  Robot. Room  1 ) 

Figure  6.1:  STRIPS  Example  —  Specific  Explanation 
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-•lnRoom(',a2,',xl2)  InRoom(9bl,',y  12) 


-lnRoom(7bl,',xl2) 


InRoom(?a2,?y  12) 


PushThru(?a2.?bl.?d2.?xl2.‘>yl2) 


Con.nects('Jd2,?xl2,’>y  12)  lnRoom(?bl,'>xl2)  InRoom(',a2.’>xl2) 


Connects('M3,?r  l,?sl) 
A 


InRoom(’>al.',y9)  -InRoom(?a  l,',x9) 


Connect^  7d3,?sl.?r  1) 


GoThruC,al.'Ml.?x9,'7y9) 


Connected  1.7x9, ?y9) 


InRoom<?a  1.7x9) 


Figure  6.2:  STREPS  Example  —  Explanation  Structure 


InRoom(?a2,?xl2)  InRoom(?bl,?yl2)  -InRooin(?bl,?xl2)  InRoom(?a2.?yl2) 


PushThru(?a2,?bl,?d2,?xl2,?y  1 2 ) 


Co  nnects(  7d  2,?x  1 2,?y  1 2) 

f 

i 

Connects(',d2,?y  12,7x1 2) 


InRoom(?bl,?xl2)  InRoom(?a2,?xl2)  ->InRoom(?a2,?x9) 


GoThru*  ?a2,?d  1  ,,x9,?x  12) 


ConnectsPd  I,?x9,'>xl2) 


InRoom(?a2.?x9) 


Figure  6.3:  STRIPS  Example  —  Generalized  Explanation 
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however,  there  are  many  well-known  planning  algorithms  which  could  be  used  for  this  task 
[ChapmanST.  Fikes72.  Sacerdoti74.  Sacerdoti77.  Waldinger77]. 

6.2.1.  Plan  Verification 

Verifying  a  plan  is  the  task  of  constructing  a  complete  explanation  for  a  sequence  of  actions. 
This  process  may  also  be  referred  to  as  '‘explaining”  or  "understanding”  the  plan.  An  example  of 
this  task  is  generating  the  explanation  shown  in  Figure  6.1  given  only  the  following  actions: 

GoThru(Robot.Docrl  .Rooml.Room2).  PushThruCRobot. Box. Door  1  .Room2.Rooml) 

In  addition  to  a  set  of  actions  performed,  facts  describing  the  initial  state  of  the  world  and  a  fact 
describing  the  desired  state  of  the  world  (the  goal)  may  also  be  provided.  The  verification  process 
attempts  to  causally  connect  these  individual  actions  and  states  into  a  explanation  that  supports  the 
goal  (if  provided).  For  the  STRIPS  blocks  world,  the  process  resembles  understanding  a  simple 
"story"  about  a  robot's  actions. 

The  basic  algorithm  used  in  EGGS  to  explain  simple  robot  plans  is  outlined  in  Figure  6.5.  The 
function  Retrieve! x I  uses  the  EGGS  deductive  retriever  (described  in  section  4.1.1)  to  try  to  infer 
the  wff  x.  If  successful.  Retrieve  asserts  the  retrieved  fact  in  the  database  along  with  its  dependen¬ 
cies  and  returns  the  retrieved  fact.  The  function  Assert  adds  a  fact  in  the  database  and  the  func¬ 
tion  Delete  removes  a  fact  from  the  database. 

The  explanation  procedure  is  given  a  chronologically  ordered  list  of  facts  representing  states 
and  actions.  If  an  input  is  a  state,  such  as  InRoom(Box.Rooml ).  and  it  cannot  be  inferred  from 
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for  x  in  the  list  of  input  wffs  do 
if  x  is  a  state  and  -Retrieve! x) 
then  Assert(x) 
if  x  is  an  action 
then 

Assert(x) 

for  each  precondition  p  of  x  do 
let  s  =  Retrieve(p) 
if  s 

then  equate  p  to  s 
else  Assert(p) 
for  each  effect  e  of  x  do 
Assert(e) 

let  d  =  Retrieve! -e) 
if  d  then  Delete(d) 
if  goal  is  known  and  Retrieve(goal) 

then  generalize  explanation  supporting  goal 

Figure  6.5:  Basic  Plan  Verification  Algorithm 


what  is  already  known,  then  it  is  sixnplv  asserted  in  the  database.  If  an  input  is  an  action,  then  the 
procedure  first  tries  to  infer  that  its  preconditions  are  met.  If  a  precondition  cannot  be  inferred, 
then,  since  the  action  is  known  to  have  taken  place,  it  is  assumed  to  be  true  and  asserted  in  the 
database.  Next,  the  effects  of  the  input  action  are  asserted  and  any  existing  facts  that  directly  con¬ 
tradict  them  are  deleted  from  the  database.  After  every  input,  if  a  goai  has  been  specified,  then  the 
system  tries  to  infer  that  the  goal  has  been  met.  When  the  goal  has  been  achieved,  the  resulting 
explanation  is  generalized. 

As  an  example  of  this  process,  consider  the  standard  STRIPS  example  mentioned  above.  In 
addition  to  the  two  actions,  assume  the  system  is  given  that  the  goal  is  InRoomCBox.  Rooml). 
Given  the  input  GoThru(Robct.  Doorl.  Rooml.  Room2).  the  preconditions  InRoomCRobot.  Rooml) 
and  Connects! Doorl.  Rooml.  Room2)  cannot  be  inferred  but  can  be  consistently  assumed,  so  they 
are  asserted  to  be  true.  The  effects  of  the  GoThru:  InRoomCRobot.  Room2)  and  -InRoomCRobot. 
Rooml).  are  then  asserted.  The  second  effect  causes  InRoomCRobot.  Rooml)  to  be  deleted.  The  sys¬ 
tem  then  tries  to  infer  that  the  goal.  InRoomCBox.  Rooml).  is  satisfied  and  fails.  Next,  the  input 
PushThruCRobot.  Box.  Room2.  Rooml)  is  considered.  The  precondition  InRoomCRobot.  Room2) 
was  aireadv  asserted  as  an  effect  of  the  Gothru  and  the  precondition  Connects! Door  1 .  Room!. 


Rooml)  can  be  inferred  from  ConnectsfDoerl,  Rooml  Room2).  The  precondition  InRoomCBox. 
Room2)  is  assumed  as  a  necessary  precondition  for  the  action.  The  effects.  InRoomC  Robot.  Rooml ). 
InRoomCBox.  Room2).  -InRoomC Robot.  Room2).  and  -'InRoomCBox.  Room2)  are  asserted  and  the 
existing  InRoom  facts  are  deleted.  The  goal  is  then  checked  again  and  this  time  is  successfully 
retrieved.  The  resulting  explanation,  shown  in  Figure  6.1.  is  then  generalized  to  produce  the  gen¬ 
eralization  shown  in  Figure  6.3. 

The  basic  understanding  process  presented  here  will  only  work  on  relatively  simple  plans  in 
which  every  action  is  explicitly  mentioned.  Later,  in  chapter  10.  this  procedure  is  elaborated  in 
order  to  allow  it  to  construct  explanations  for  natural  language  narratives.  .Nevertheless,  the  sim¬ 
ple  procedure  presented  here  forms  an  important  part  of  the  understanding  system  in  GENESIS. 

6.2.2.  Learning  Macro-Operators 

In  order  to  be  useful  for  later  planning,  the  generalized  explanation  must  be  converted  into  a 
definition  for  a  new  action  (a  macro-operator)  which  summarizes  the  preconditions  and  effects  of 
the  combination  of  actions.  The  macro-operator  learned  from  the  STRIPS  example  is  shown  in 
Table  6.2.  STRIPS  used  triangle  tables  to  efficiently  store  macro-operators  for  all  sub-sequences  of 
the  original  plan.  Currently.  EGGS  only  builds  macro-operators  for  complete  sets  of  actions.  How- 
ex  er.  unlike  STRIPS.  EGGS  addresses  the  case  in  which  the  actions  in  the  plan  are  only  partially 
ordered  instead  of  completely  ordered.  The  additional  goal  is  to  find  the  minimally  constrained 
partial  ordering  of  the  actions  that  allows  them  to  be  connected  as  they  were  in  the  example  (i.e. 
maintaining  the  unifications  between  preconditions  and  effects). 

Compared  to  a  macro-rule  learned  by  applying  EGGS  to  a  deductive  formulation  of  planning 
las  in  section  4.2.6).  a  macro-operator  can  be  more  informative  and  more  general.  A  macro-rule 
like  that  presented  in  section  4.2.6  specifies  only  one  effect  of  a  combination  of  actions  while  a 
macro-operator  specifies  ail  of  the  effects.  In  addition,  a  macro-rule  for  a  proof  in  situation  cal- 
cuius  cannot  represent  partially  ordered  actions.  Re-ordering  actions  in  such  a  formulation 
requires  completely  altering  the  structure  of  the  existing  explanation.  Consequently .  generalizing 
the  order  of  actions  in  such  a  formulation  would  be  much  more  difficult. 

Building  a  macro-operator  requires  determining  the  overall  preconditions  and  effects  of  the 
general  plan.  The  leaves  of  the  generalized  explanation  form  a  set  of  preconditions  for  the  macro¬ 
operator  since  they  represent  preconditions  of  actions  in  the  plan  that  are  not  achieved  by  previous 
actions.  For  the  STRIPS  example,  the  first  four  preconditions  from  Table  6.2  are  the  leaves  of  the 
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Table  6.2:  Macro-Operator  Learned  from  the  STRIPS  Example 
GoThruPushThru(',a2.7dl.'>x9.',bl.?d2.',xl2.',vl2) 


Preconditions 

1  Effects 

lnRoom(7a2.?x9) 

InRoom(?bl.?v  12)  [ 

InRooxn(?bl.?xl2) 

lnRoom(?a2.?v  12)  I 

1  Connects(9dl.?x9.?xl2) 

-,lnRoom(?a2.7xl2)  j 

ConnectsOd2.?v  12.7x1 2) 

-HnRoomC’bl  .7x12)  ! 

1  -('>a2-',bl  ^  ?x9-?xl2) 

0yl25e°x9  —  ->InRoom(',a2.‘7x9) 

generalized  explanation  shown  in  Figure  6.3.  The  hnal  precondition  m  the  table  is  added  later  and 
is  discussed  in  the  following  section. 

The  effects  of  the  overall  plan  are  more  difficult  to  deterrrnne  since,  as  noted  in  [Fikes72  (sec¬ 
tion  4.3)].  the  deletions  of  facts  that  occur  m  general  may  not  be  the  same  as  the  deletions  that 
occurred  in  the  specific  instance.  To  illustrate  this  point,  they  use  the  following  simple  plan  for 
pushing  two  boxes  to  two  separate  locations:  Push(Boxl.  Loci),  Pusb(Box2.  Loc2).  The  initial  gen¬ 
eralization  for  this  plan  is:  Push(?bl.  ?il).  Push(?b2.  '’12)  which  has  the  effects:  At(?bl.  ?ll). 
At(?b2.  ?12).  However,  if  ?bl  and  ?b2  are  instantiated  to  the  same  box  and  ?11  and  ?12  are  instan¬ 
tiated  to  distinct  locations,  this  plan  miraculously  achieves  the  goal  of  having  the  same  box  be  in 
two  places  at  once.  The  problem  is  that  in  certain  situations  the  actions  in  a  macro-operator  may 
delete  facts  that  were  not  deleted  in  the  specific  example.  Consequently,  in  order  to  determine 
which  facts  are  still  true  at  the  end  of  the  execution  of  the  macro-operator,  one  must  determine  all 
deletions  that  might  occur  in  the  general  case. 


6.2.2. 1.  Determining  Deletions  and  Preventing  Protection  Violations 

Determining  possible  deletions  is  discussed  in  [Fikes72  (section  4.3)]:  however,  the  problem  is 
more  complicated  when  one  considers  partially  ordered  actions.  Most  ordering  constraints  in  a  plan 
are  imposed  by  one  action  achieving  a  precondition  for  another  action.  These  orderings  are  captured 
by  the  explanation  structure:  an  action  must  precede  another  action  if  it  eventually  supports  it. 
Any  topologically  sorted  [Reingold77.  SedgewickS3]  list  of  the  actions  in  the  generalized  explana¬ 
tion  graph  will  be  an  ordering  that  satisfies  these  constraints.  However,  some  of  these  orderings 
may  result  in  protection  violations  [CharniakS5.  Sussman73]  in  which  an  action  deletes  or  clobbers  a 
goal  or  a  precondition  for  a  later  action.  When  examining  deletions  in  a  macro-operator,  it  is 
important  to  notice  possible  protection  violations  and  impose  additional  ordering  constraints  to 
avoid  them. 
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Figure  6.6  describes  the  algorithm  EGGS  uses  to  determine  deletions  for  the  generalized  expla- 
r.at.on  and  impose  additional  ordering  constraints  to  prevent  protection  violations.  In  the  algo¬ 
rithm.  Support <•?/£.  bl  refers  to  a  function  that  returns  true  iff  c  eventually  supports  b  in  the  gen¬ 
eralized  explanation,  i.e.  if  there  is  a  directed  path  from  a  to  b  in  the  explanation  graph.  For  each 
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action,  the  algorithm  determines  the  set  of  states  in  the  explanation  that  could  be  true  before  the 
action  is  executed  based  only  on  the  partial  ordering  of  actions  imposed  by  the  explanation 


for  each  action  a  in  the  generalized  explanation  do 
let  deietable  =  0 

for  each  state  s  in  the  generalized  explanation  do 

if  -Supports?(a.  s)  then  let  deietable  =  deietable  U  Is! 
for  each  effect  e  of  a  do 

for  each  state  d  in  deietable  do 
if  -d  unifies  with  e 
then 

let  a  be  the  conditions  under  which  ->d  and  e  unify 
let  c  be  the  action  of  which  d  is  an  effect  (possibly  NIL) 
if  null(c)  V  Supports?(c.  a) 
then 

if  d  is  the  goal  or  supports  a  proof  of  the  goal 
then  add  -<<*  to  the  preconditions 
else  d  is  deleted  by  e  when  a  is  true 

else 

if  d  is  the  goal  or  supports  a  proof  of  the  goal 
then  a  must  precede  c  when  at  is  true 
else  d  is  deleted  by  e  when  a  is  true  and  c  precedes  a 
PreventPrecondmonClobberingCa.  d.  c.  a) 
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procedure  PreventPreconditionClobberingfa.  d.  c.  a) 

Add  conditions  to  prevent  a  from  clobbering  precondition  d  when  a  is  true  *) 
for  each  action  b  such  that  d  is  a  precondition  of  b  or  supports  a  proof  of  a  precondition  of  b  do 
if  Supports'i’U.  b) 
then 

if  null(c)  V  Supports?(c.  a) 

then  add  - at  to  the  preconditions 
else  a  must  precede  c  when  a  is  true 

else 

if  null(c)  V  Supports'Xc.  a) 

then  b  must  precede  a  when  a  is  true 

else  a  must  precede  c  or  b  must  precede  a  w  hen  a  is  true 
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Figure  6.6:  Algorithm  for  Determining  Deletions  and  Preventing  Protection  Violations 
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structure.  These  are  called  the  deferable  states  since  they  are  ones  that  the  action  could  possibly 


delete.  For  each  of  these  states,  the  condition  (a)  under  which  the  action  could  delete  the  state  m 
general  are  determined.  For  example,  if  an  effect  of  an  action  is  -'InReom(7x.  ?y),  then  the  state 
InRoom(?a.  ?b)  is  deleted  iff  ?x=7 a  and  ?y=?b.  If  the  deletable  state  is  not  supported  by  another 
action  (i.e.  it  is  a  precondition  of  the  macro-operator)  or  if  it  is  an  effect  of  an  action  that  must  pre¬ 
cede  the  deleting  action  because  it  supports  it.  then  the  state  must  be  true  before  the  action  is  exe¬ 
cuted  and  it  will  be  delete  1  when  a  is  true.  If  the  deletable  state  is  also  the  goal  or  supports  a 
proof  of  the  goal,  then  -a  is  added  to  the  preconditions  to  prevent  the  goal  from  being  clobbered. 
Otherwise  it  is  marked  as  being  deleted  w-hen  a  is  true.  If  the  deletable  state  is  not  necessarily  true 
before  the  action  is  executed,  then  it  will  be  deleted  when  both  a  is  true  and  the  action  supporting 
it  precedes  the  deleting  action.  If  the  deletable  state  is  the  goal  or  supports  a  proof  of  the  goal,  then 
in  order  to  prevent  the  goal  from  being  clobbered,  the  deleting  action  must  precede  the  action  sup¬ 
porting  the  deleted  state  when  a  is  true. 

The  procedure  PreventPreconditionClobbering  adds  additional  ordering  constraints  and 
preconditions  to  the  macro-operator  in  order  to  prevent  a  precondition  from  being  clobbered  when 
action  a  deletes  state  d.  If  an  effect  of  action  a  deletes  a  precondition  p  of  action  b.  then  in  order  to 
avoid  a  protection  violation  either  b  must  precede  a  or  c  must  precede  action  c  where  p  is  an  effect 
of  c  [Sacerdoti77.  Sussman73].  However,  the  existing  partial  ordering  of  the  actions  imposed  by  the 
explanation  structure  may  rule  out  either  or  both  of  these  possibilities.  If  both  possibilities  are 
ruled  out.  then  --a  is  added  to  the  list  of  preconditions  for  the  macro-operator  to  prevent  the  plan 
from  being  used  in  such  situations.  Otherwise,  the  possible  ordering  constraints  that  will  prevent 
the  protection  violation  are  recorded. 

As  one  example  of  the  process  of  determining  deletions  and  preventing  protection  violations, 
consider  the  standard  STRIPS  example.  Below  is  the  trace  produced  by  the  system  when  examining 
the  deletions  for  the  generalized  explanation  sho-  /n  in  Figure  6.3. 

Plan  not  valid  when  ?a2«?bl  A  7x9-?xl2  because  ->InRoom(?a2,?x9)  clobbers  lnRoom(?bl.?.\12) 


In  general  -InRoom(?a2.°x9)  deletes  InRoom(?a2.?x9)  when  T 

In  general  InRoom(?a2.?yl2)  deletes  -InRoom(?a2.?x9)  when  ?yl2-7x9 

In  general  InRoom(?bl.?vl2)  deletes  -InRoom('\i2.?x9)  when  ?bl-7a2  A  7yl2»?x9 

In  general  -Ir.Room(?bl  .7x12)  deletes  InRoom(?bl  .?x  12)  when  T 

In  general  -InRoomC’bl  Txl2 )  deletes  InRoom(7a2.?x9)  when  7bl="’a2  A  7xl2*?x9 

In  general  -"InRoom,",bl.7xl2)  deletes  InRoom(?a2.'7x  12)  when  7bl-'1a2 
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In  general  -■InRoom('’a2.?xl2)  deletes  InRoom(°bl  9xl2)  when  9a2='>bl 
In  general  -inReom(0a2.0xl2)  deletes  InRoom(?a2.?x9)  when  7xl2=9x9 
In  general  -’InRoomt ?a2,'i’xl2)  deletes  lnRoom(?a2.?xl2)  when  T 

One  important  thing  to  notice  is  that  -|(',a2=',bl  A  7x9=7x12)  is  added  to  the  list  of  preconditions 
for  the  plan  in  order  to  prevent  the  GoThru  action  from  clobbering  a  precondition  of  the  PushThru. 
Reordering  the  steps  is  ruled  out  since  the  explanation  structure  requires  that  the  GoThru  precede 
the  PushThru  and  since  the  clobbered  precondition  is  a  precondition  of  the  entire  macro-operator 
and  consequently  is  not  achieved  by  an  action  that  can  be  performed  after  the  GoThru.  Actually, 
for  7a2  to  be  equai  to  ?bl  would  require  that  an  agent  be  able  to  push  himself  through  a  door,  and 
for  7x9  to  be  equal  to  7x12  would  require  that  an  agent  be  able  to  go  through  a  door  from  a  room 
back  to  the  same  room.  In  a  better  formulation  of  the  domain,  both  of  these  conditions  could  be 
disallowed  by  adding  additional  preconditions  to  GoThru  and  PushThru. 

6.2.2.2.  Complexity  of  Determining  Deletions  and  Preventing  Protection  Violations 
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The  worst-case  time  complexity  of  the  algorithm  in  Figure  6.6  is  relatively  easy  to  determine. 
In  the  following  discussion,  loops  will  be  referred  to  by  their  iterative  variables  (e.g.  the  a  loop,  the 
e  loon.  etc.). 


Let  the  number  of  nodes  in  the  generalized  explanation  graph  be  n.  Since  the  function  Sup¬ 
ports?  determines  whether  or  not  there  is  a  directed  path  from  one  node  to  another  in  this  graph,  in 
the  worst  case  it  must  traverse  the  entire  graph.  Since  traversing  a  graph  is  OdVj+E)  where  M  is 
the  number  of  nodes  and  £  is  the  number  of  edges  and  since  the  worst  case  complexity  of  a 

call  to  Supports 9  is  0(n:). 

Since  the  number  of  actions  must  be  iess  than  n.  the  outermost  loop  (the  a  loop)  is  executed  at 
most  n  times.  Since  the  number  of  states  is  also  less  than  n.  the  body  of  the  loop  for  determining 
the  set  of  deletable  states  (the  s  loop)  is  executed  at  most  n  times  for  each  action  for  a  maximum 
totai  of  rr  times.  Since  the  body  of  this  loop  includes  a  call  to  Supports 7  whicn  is  O (n~).  the  totai 
maximum  time  needed  for  all  executions  of  the  s  loop  is  0 (n4).  Since  each  deletable  set  :s  a  subset 
of  the  set  of  states,  its  maximum  cardinality  is  n. 


Since  the  totai  number  of  effects  of  actions  must  be  less  than  the  number  of  states,  the  body 
cf  the  e  loop  is  executed  at  most  a  total  of  n  times.  Since  n.  is  the  maximum  size  of  a  deletable  set. 
’"e  body  cf  the  J  loop  is  executed  at  most  n.  times  for  each  effect  for  a  maximum  total  of  n:  times. 
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in  addition  to  the  call  to  PreventPreconditionClobbering.  tne  operations  in  the  body  cf  the  d  loop 
that  depend  on  the  size  of  r.  include  a  call  to  Supports ?  and  determining  whether  a  state  supports  a 
proof  cf  the  goal.  Since  both  of  these  operations  involve  finding  a  path  between  two  nodes  m  the 
explanation  graph,  they  take  at  most  0 (n2)  time.  Therefore,  let  the  total  maximum  time  needed 
for  all  executions  of  the  e  loop  be: 

n2(0(n2)  +  P) 

where  P  is  the  time  complexity  of  a  call  to  PreventPreconditionClobbering. 

The  body  of  the  b  loop  in  PreventPreconditionClobbering  is  executed  less  than  n  times  since 
the  total  number  of  actions  is  less  than  n.  Since  executing  the  body  of  the  loop  requires  two  calls  to 
Supports?  w'hich  takes  0(n2)  time,  the  maximum  time  required  for  a  call  to  this  function  is  0(n3). 
Therefore,  the  total  maximum  time  needed  for  all  executions  of  the  e  loop  is: 

r.2(0(n2)  +  0(n3))  =  0(n3) 

Since,  the  total  maximum  time  needed  for  all  executions  of  the  s  loop  was  calculated  to  be  only 
Olr.4).  the  worst  case  time  complexity  of  the  complete  process  is  also  0(n5). 
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The  fact  that  the  O (n2)  Supports ?  function  is  called  within  bodies  of  loops  that  are  executed 
0(n:)  ar.d  0(n3)  times  is  a  dominant  factor  in  the  complexity  of  the  complete  algorithm.  If  a  call 
to  this  function  could  be  reduced  to  constant  time,  a  minor  modification  of  the  preceding  analysis 
demonstrates  that  the  complexity  of  the  complete  process  would  then  be  0(n3).  The  maximum  time 
required  for  all  executions  of  the  s  loop  becomes  O (n2).  The  total  maximum  lime  for  all  executions 
of  the  e  loop  becomes: 

n2(c  +  P) 

where  c  is  a  constant.  The  time  complexity  of  a  call  to  PreventPreconditionClobbering  ( P )  becomes 
0 (n)  and  the  overall  worst  case  complexity  becomes: 

0(n:)  *r  n2(c  +  O(n))  =  0(n3) 

Since  Suppor:s?la.b )  simply  determines  whether  or  not  there  is  a  directed  path  from  a  to  b,  the 
value  of  this  function  for  ail  pairs  of  the  n  nodes  in  the  explanation  graph  can  be  determined  by 
computing  the  graph's  transitive  closure.1  Since  there  exist  OGVr5)  algorithms  for  computing  the 
transitive  closure  of  a  directed  graph  [Reingold77],  all  the  values  of  Support?  for  a  particular  expla¬ 
nation  can  be  pre-computed  in  0 (n3)  time.  Calls  to  this  function  wdll  then  take  constant  time  and 
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the  complete  process  of  determining  deletions  zr.d  protection  violations  can  be  accomplished  m 
CK  n3'!  time. 

In  conclusion,  although  the  algorithm  implemented  in  EGGS  is  0(/z3).  computation  of  the  the 
transitive  closure  of  the  generalized  explanation  graph  using  one  of  the  known  0(|Vp)  algorithms 
would  result  in  an  0(n3)  algorithm  for  the  problem  of  determining  deletions  and  preventing  pro¬ 
tection  violations. 

6.2.2.3.  Determining  the  Effects  of  a  Macro-Operator 

Once  all  possible  deletions  have  been  marked,  determining  the  effects  of  the  over-all  macro¬ 
operator  is  quite  simple.  The  effects  of  each  individual  action  are  considered  in  turn.  If  an  effect 
was  never  marked  as  being  deleted  under  any  condition,  then  it  is  added  to  the  list  of  effects  for  the 
macro-operator.  In  the  STRIPS  example,  this  accounts  for  the  first  four  effects  shown  in  Table  6.2. 
If  a  state  was  marked  as  being  deleted  under  some  set  of  conditions,  then  the  overall  effect  is  given 
by  the  following  implication:  if  none  of  the  deletion  conditions  are  met.  then  the  state  is  true.  In 
the  STRIPS  example,  the  state  -InRoom(?a2.  ^ x9 )  was  marked  as  deleted  if  and  only  if  ,,yl2=?x9. 
Consequently,  the  final  effect  of  the  macro-operator  is:  -'(?vl2*?x9)  —  -'InRoom(7a2.  7x9).  This 
simply  states  that  unless  the  agent  pushes  the  object  back  to  the  room  he  originally  started  in  (as  in 
the  observed  instance),  then  he  is  no  longer  in  the  starting  room. 

6.2-2.4.  Detecting  Inconsistent  Ordering  Constraints 

One  of  the  final  steps  in  building  a  macro-operator  is  the  detection  and  prevention  of  incon¬ 
sistent  ordering  constraints.  The  ordering  constraints  posted  to  prevent  protection  violations  may 
contradict  each  other  in  certain  situations.  For  example,  one  constraint  may  state  that  action  A 
must  proceed  action  B  when  a  is  true  while  another  states  that  B  must  proceed  A  when  3  is  true.  In 
order  to  detect  such  situations,  a  resolution  theorem  prover  is  used  to  determine  sets  of  ordering 
constraints  that  are  inconsistent.3  In  addition  to  the  posted  ordering  constraints,  the  theorem  prover 
is  given  the  following  two  axioms: 

Beforefl'a.l’b)  ^  Before(?b.?c)  -•  Before(?a.?c) 

BeforeC’a.l’b)  —  -Before(?b.?a) 

if  and  oniy  :f  there  :s  a  non-empty  /  directed  path  from  u  to  v  in  E  [Even79j. 

3  Because  of  disjunctive  ordering  constraints,  a  simpie  cycle -detecting  algorithm  cannot  be  usee  to  and  al'.  sets  of  incon¬ 
sistent  constraints. 
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Wner.  the  resolution  theorem  prover  finds  a  contradiction,  it  returns  the  set  of  axioms  upon  which 
the  proo:  depends.  In  order  to  prevent  such  a  contradiction  from  arising  during  the  subsequent  use 
of  the  macro-operator,  an  additional  precondition  must  be  added.  Assuming  each  of  the  n  ordering 
constraints  ci  in  the  contradictory  set  must  be  satisfied  when  condition  a  is  true,  the  following 
precondition  must  be  added  to  the  macro-operator:  -'(oq  A  a,  ...  A  q  )  In  the  example  above,  the  wff: 
-(a  A  /3)  would  be  the  necessary  additional  precondition.  A  more  concrete  example  is  given  in  sec¬ 
tion  6.3. 1 . 

Of  course,  using  a  resolution  theorem  prover  to  find  all  sets  of  contradictory  axioms  is  a  com¬ 
putationally  intractable  process  which  is  not  even  guaranteed  to  halt.  In  EGGS,  the  theorem  prover 
is  given  a  time  limit,  after  which  it  stops  and  returns  proofs  for  all  the  contradictions  it  has  found 
sc  far.  Consequently,  the  system  does  not  actually  guarantee  the  prevention  of  ordering  contradic¬ 
tions  during  iater  instantiation.  Should  such  a  contradiction  arise,  one  could  perform  explanation- 
based  learning  from  failure  to  modify  the  plan  at  that  time  in  order  to  prevent  similar  problems  in 
the  future.  Some  actual  systems  that  use  EBL  to  learn  from  failures  due  to  unforeseen  interactions 
are  described  in  [GuptaS7]  and  [ChienS7a]. 

6.2.2.5.  Simplifying  the  Results 

The  final  set  of  wffs  describing  the  preconditions,  effects,  and  ordering  constraints  for  a 
macro-operator  can  be  unnecessarily  complex.  Consequently.  EGGS  uses  a  system  for  simplifying 
logical  expressions  to  clean  up  the  final  results.  As  described  in  [MintonS7a],  simplification  or 
compression  heips  to  decrease  the  matching  cost  needed  to  determine  whether  or  not  a  macro- 
operator  applies. 

The  simplifier  currently  used  by  EGGS  first  converts  expressions  to  conjunctive  normal  form 
and  eliminates  tautological  and  subsumed  clauses.  Next,  a  set  of  axioms  supplied  by  the  user  for 
each  domain  are  used  to  further  simplify  the  CNF  formula.  If  a  literal  l  in  a  clause  together  with 
the  domain  axioms  and  other  clauses  implies  another  literal  in  the  clause,  then  the  literal  L  is 
removed.  The  EGGS  deductive  retriever  is  used  to  prove  the  implication.  This  is  a  valid 
simplification  since: 

((PAR)-Q)  -  ((R  A  ( PVQ ) )  =  (R  A  Q)) 

Therefore,  if  (P'R)— *Q  is  true  in  the  domain,  then  the  reduced  formula  is  equivalent  to  the  origi¬ 
nal.  Clauses  that  are  implied  by  other  clauses  together  with  the  domain  axioms  are  also  removed. 
Once  again,  the  EGGS  deductive  retriever  is  used  to  prove  the  implication.  This  simplification  is 


.  a.:c  since. 


(P— Q)  -  (CP' 0)  =  P) 

Therefore,  if  P— 'Q  is  true  in  the  domain,  then  the  reduced  formula  is  equivalent  to  the  original. 
Finally.  DeMorgan's  law  is  used  to  convert  clauses  that  are  more  compactly  represented  as  negated 


terms.  That  is.  clauses  of  the  form: 


are  changed  to: 


■’ZjV-Z,  ... 


-.(<  a/  n  a i  ) 

A  concrete  example  of  the  amount  of  simplification  possible  using  this  method  is  given  in  sec¬ 
tion  6.3.1.  Of  course,  the  EGGS  simplifier  is  not  guaranteed  tc  produce  optimal  simplifications: 
however,  general  problems  of  minimizing  logical  expressions  are  known  to  be  XP-Complete  (see 
Minimum  Axiom  Set  and  Minimum  Disjunctive  Normal  Form  in  [Garey79]). 

6.2.2.6.  Relation  to  Nonlinear  Planning 

Work  in  nonlinear  planning  [Sacerdoti77.  Tate76‘]  has  addressed  the  issue  of  building  plans 
wi :h  partially  ordered  actions  by  initially  assuming  there  are  no  goal  interactions  and  then  detect¬ 
ing  protection  violations  and  imposing  ordering  constraints  to  prevent  them.  Although  the  task  of 
building  a  specific  partially-ordered  plan  for  achieving  a  specific  goal  is  quite  different  from  the 
task  of  generalizing  a  specific  totally-ordered  plan  into  a  general  partially-ordered  macro-operator, 
some  of  the  underlying  processes  are  quite  similar. 

In  [NilssonSO],  the  procedure  for  detecting  and  preventing  protection  violations  in  nonlinear 
planning  is  called  DCOMP.  The  procedure  in  Figure  6.6  is  in  some  ways  more  general  and  in  some 
wavs  less  general  than  DCOMP.  It  is  more  general  because  it  must  determine  the  conditions  (a) 
under  which  a  deletion  will  take  place  in  the  generalized  plan.  DCOMP  works  with  a  specific 
instance  of  a  plan  and  therefore  does  not  need  to  determine  which  instantiations  of  a  plan  result  in 
a  deletion.  The  procedure  in  Figure  *6.6  is  less  general  than  DCOMP  because  it  does  not  consider 
alternative  wavs  of  achieving  the  preconditions  of  an  action.  The  explanation  structure  explanation 
determines  how  preconditions  were  met  in  the  particular  example,  and  these  constraints  a*e 
retained  in  the  generalization.  Using  the  terminology  from  [NilssonSO].  the  procedure  r-esen-en 
here  does  not  determine  all  of  the  possible  adders  of  a  precondition  in  the  plan  since  eacr.  r-*. 
tion  is  assumed  to  added  by  the  action  that  added  it  in  the  originai  specific  instance  <  >: 
even  more  general  macro-operator  couid  be  produced  by  generalizing  the  explanar.-T 
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considering  alternative  adders.  However,  this  approach  would  ever,  further  complicate  the  pro¬ 
cedure  for  generating  a  macro-operator. 

The  actual  process  of  determining  what  Nilsson  calls  a  noninteractive  ordering  for  a  particular 
instantiation  of  a  plan  must  be  done  when  the  final  macro-operator  is  used  to  solve  a  future  prob¬ 
lem.  Assuming  all  inconsistent  sets  of  ordering  constraints  were  detected  and  prevented  by  adding 
additional  preconditions  to  the  macro-operator,  for  any  situation  that  meets  the  preconditions, 
there  is  guaranteed  to  be  a  partial  ordering  of  the  actions  that  satisfies  all  of  the  ordering  con¬ 
straints.  However,  due  to  the  possible  presence  of  disjunctive  ordering  constraints,  actually  finding 
it  may  require  an  expensive  search.  However,  this  should  not  be  surprising  since  the  increased  gen¬ 
erality  of  a  partially-ordered  macro-operator  can  clearly  result  in  a  corresponding  decrease  in 
operationaiity. 

6.3.  Plan  Examples 

Unfortunately,  the  STRIPS  example  does  not  illustrate  many  of  the  aspects  involved  in  con¬ 
structing  macro-operators  with  partially  ordered  actions  because  the  explanation  structure  itself 
imposes  a  total  ordering  on  the  two  actions  in  the  plan.  This  section  presents  an  additional  example 
involving  program  assignment  statements  in  which  the  actions  are  not  totally  ordered.  In  addition 
to  STRIPS  and  variable  assignment  examples  like  the  ones  presented  in  this  chapter,  the  system  has 
also  been  tested  on  classical  biock  stacking  examples  (like  those  in  [Sussman73])  and  narrative 
understanding  examples  (see  chapters  9-11  and  appendix  B.  in  particular  the  Arson  example  in  sec¬ 
tion  B.2  contains  an  interesting  potential  protection  violation). 

6.3.1.  Variable  Assignment  Example 

An  action  definition  for  assigning  a  value  to  a  variable  modelled  after  LISP’s  SETQ  function  is 
shown  in  Tabie  6.3.  Seiq(?a.?b)  sets  variable  ?a  to  the  value  of  variable  ?b  and  Yalue(?a.?x)  means 
that  variable  '’a  has  value  ?x.  To  make  the  problem  interesting,  it  is  assumed  that  we  are  modelling 
parallel  computation  in  which  any  number  of  Setq  s  can  be  performed  simultaneously.  As  a  result, 
determining  possible  protection  violations  in  this  domain  is  equivalent  to  detecting  interactions 
between  parallel  computations:  and  imposing  ordering  constraints  is  equivalent  to  forcing  certain 
statements  to  be  executed  sequentially. 

Consider  the  following  simple  problem  and  the  plan  for  solving  it: 

Given:  Value(A.B).  Value(X.Y).  Vaiue(R.N).  Value(S.M) 
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Table  6.3:  Variable  Assignment  Actions 


Action 

Preconditions  1  Effects 

SetqC,a.?b) 

Value(?a.?x) 

Value(?b.?v) 

Value(?a.?y) 

->Value(?a.?x) 

Goal:  Yalue(A.X)  A  Value(X.M) 

Plan:  Setq(A.R).  Setq(X.S) 

The  specific  and  generalized  explanations  constructed  by  EGGS  while  verifying  this  plan  are  shown 
in  Figure  6.7  and  Figure  6.S  respectively.  Notice  that  the  explanation  structure  does  not  impose  an 
ordering  on  the  two  Seta  operators.  A  trace  of  the  process  for  determining  deletions  and  preventing 
protection  violations  for  this  example  is  given  below: 

In  general  Value(?a2.?y2)  deletes  ->Value(?al.?xl)  when  (?a2=*?al  A  ?y2=?xl)  and  Setq(?al.?bl) 
precedes  Seta(?a2.?b2) 

In  general  -<Value(?a2.?x2)  deletes  Value(?b2.?y2)  when  (?a2=?b2  A  ?x2«?y2) 

In  general  -Value(?a2.?x2)  deletes  Value(?a2.?x2)  when  T 

Setq(?al,?bl)  should  precede  Setq(?a2.?b2)  when  (?a2=*?bl  A  ?x2=?vl)  or  else  -•Value(?a2,?x2) 
will  clobber  Value(?bl.?yl) 

In  general  ->Value(?a2.?x2)  deletes  Value(?bl.?yl)  when  (?a2«?bl  A  ?x2«?yl) 

Setq(?al.?bl)  should  precede  Setq(?a2,?b2)  when  (?a2=7al  A  ?x2=?xl)  or  else  ->Value(?a2.?x2) 
will  clobber  Value(?al.?xl) 


(Value(A.N)  A  Value(X.M)) 


Value(X.M)  -Value(X.Y)  Value(A.N) 


-Value(A.B) 


Setq(X.S) 


Setq(  A.R) 


Value(S.M)  Value(X.Y)  Value(R.N)  Yalue(A.B) 

Figure  6.7:  Variable  Assignment  Example  —  Specific  Explanation 


(Value(?al.?yl)  A  Value(?a2.?v2)) 


Valuc(?b2.?y2)  Value!  ?a2.?x2)  Value(?bl.?yl)  Value(?al.?xl) 

Figure  6.8:  Variable  Assignment  Example  —  Generalized  Explanation 


In  general  *'Value(?a2.?x2)  deletes  Value(?al.?xl)  when  (?a2~?al  A  ?x2=?xl) 

Setq(?a2.?b2)  should  precede  Setq(?al.?bl)  when  C?a2=«?al  A  ?x2-?yl)  or  else  -Value(7a2.?x2) 
will  clobber  goal:  Value(?al.?yl) 

In  general  Value(?al.?yl)  deletes  -Value(?a2.?x2)  when  (?al~?a2  A  ?yl-?x2)  and  Seta(7a2.?b2) 
precedes  Setq(?al.?bl) 

Setq(?a2.?b2)  should  precede  Setq(7al  ,?bl)  when  (7al“?b2  A  ?xl«?v2)  or  else 
-,Value(?al.?xl)  will  clobber  Value(?b2.?y2) 

In  general  -,Value(?al.?xl)  deletes  Value(?b2.?v2)  when  (?al»?b2  A  ?xl»?y2) 

Setq(?a2.?b2)  should  precede  Setq(?al.?bl)  when  (?al-?a2  A  ?xl-?x2)  or  else  ->Value(?al.?xl) 
will  clobber  Value(?a2.?x2) 

In  general  -'Value(?al.?xl)  deletes  Value(?a2.?x2)  when  (?al«?a2  A  ,,xl«*?x2) 

Setq(?al.?bl)  should  precede  Setq(7a2.?b2)  when  (?al«7a2  A  ?xl«?y2)  or  else  ->V'alue(?al.?xl) 
will  clobber  goal:  Value(?a2.7v2) 

In  general  ->Vaiue(?al.?xl)  deietes  Value(?bl ,7v  1 )  when  (?al*?bl  A?xl»?vl) 

In  general  -Value! 7al.?xl)  deietes  Value! ?al.?xl)  when  T 

I  Before’,  Seiq!7a2.°b2).Setq(?al.'1bl))  A  Before! Setq(?al.?bl).Setq(?a2.7b2))) 
results  in  a  contradiction.  Add  precondition:  -(((7a2»?al  A  ?x2«?vl)  V  (7al-?b2  A  ,’xl-7y2)  V 
!',al-"’a2  A  ?xl-?x2))  A((?a2-?bl  A  ?x2-?yl)  V  (?a2-7al  A  ?*2-?xl) V  (7al-^a2  A  7xl-7v2))) 
which  simplifies  to  -!?al«!*b2  A?a2»°bl)A  -!7a2-7al) 


>11] 


The  system  detects  that  each  Seta  could  potentially  clobber  a  precondition  of  the  other  one  if  cer¬ 
tain  variables  are  the  same.  Ordering  constraints  are  posted  to  prevent  protection  violations  in  these 
situations.  However,  these  ordering  constraints  contradict  each  other  in  certain  situations.  For 
example,  when  ?al=?a2  and  ?xl=?x2  these  constraints  specify  that  each  of  the  two  Setqs  should 
precede  the  other  since  in  this  situation  each  one  clobbers  a  precondition  of  the  other.  The  process 
described  in  section  6.2.2. 4  detects  such  possible  contradictions  and  adds  preconditions  to  prevent 
them.  In  the  example,  this  process  results  in  both  ->(?a2=?al)  and  -(',al=*9b2  A  9a2«9bl)  being 
added  as  preconditions  to  the  macro-operator.  When  simplifying  the  final  results,  the  system  made 
use  of  the  following  domain  axiom:  Value(?x.?a)  A  Value(?y.?b)  A  9x=9y  —  9a=?b.  This  axiom  sim¬ 
ply  states  that  a  variable  can  have  only  one  value  at  a  time.  Table  6.4  shows  the  macro-operator 
definition  resulting  from  the  complete  packaging  process. 


In  summary,  the  system  notices  that  if  the  two  variables  being  set  C*al  and  ?a2)  are  the  same, 
then  the  goal  of  having  them  set  to  two  different  values  cannot  be  achieved.  Also,  if  either  assign¬ 
ment  references  the  variable  set  by  the  other,  then  the  two  cannot  be  executed  in  parallel.  In  these 
cases,  the  assignments  must  be  properly  ordered  to  prevent  the  referenced  variable  from  being  reset 
before  it  is  used.  Finally,  if  each  assignment  references  the  variable  sec  by  the  ocher,  then  two 


assignments  cannot  solve  the  problem.  This  is  the  classic  "variable  swapping”  problem  which 
requires  the  use  of  a  temporary  variable  (see  section  7.5.3  of  [N’ilssonSO].). 

! _ Table  6.4:  Macro-Operator  Learned  from  the  Variable  Assignment  Example _ ! 

SetqSetq(?al.?bl.?x2.?v2.?xl.?yl.?a2.?b2) 
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CHAPTER  7 

EGGS:  EMPIRICAL  RESULTS  ON  THE  EFFECT  OF  LEARNING 


This  chapter  summarizes  some  empirical  results  on  the  effect  of  learning  on  problem  solving 
performance  in  all  of  the  domains  discussed  in  chapters  3.  4  and  51.  The  macro-rules  learned  by 
EGGS  affect  its  future  performance  in  various  ways.  After  learning  a  macro-rule  from  a  particular 
example,  performance  on  similar  problems  is  enhanced.  However,  performance  may  be  degraded  on 
problems  that  are  superficially  similar  but  can  not  be  solved  using  the  macro-rule. 


7.1.  Performance  Improvement 

Table  7.1  summarizes  empirical  results  on  positive  transfer,  the  ability  of  learning  to  improve 
problem  solving  performance  on  similar  problems.  It  gives  the  CPU  time  in  seconds  required  to 
solve  a  problem  both  before  and  after  learning  a  macro-rule  capable  of  solving  the  problem  in  one 
step.  In  each  of  the  domains,  the  learned  rule  is  the  one  discussed  in  the  associated  section  in 
chapter  3.  4.  or  5.  The  CPU  time  required  to  learn  the  rule  is  also  given,  where  learning  includes 
pruning  and  generalizing  the  explanation  as  well  as  packaging  it  into  a  macro-rule.  Performance 
was  tested  on  solving  the  test  problem  discussed  for  each  domain  before  and  after  learning  the  rule 


Table  7.1;  Speedup  Due  to  Learning 


Example 

Learning 

CPU  sec 

Solution  CPU  sec 

Speedup 

Before 

After 

CUP 

1.08 

4.52 

3.70 

1.30 

Suicide 

.76 

1.69 

1.3S 

1.22 

SafeToStack 

.81 

3.69 

2.72 

1.36 

LEAP 

.73 

3.99 

.79 

5.05 

MA 

.67 

759.00 

.53 

1432.00 

LEX2 

.76 

2.02 

1.31 

1.54 

Geometry 

1.26 

5.71 

1.53 

3.73 

Geometrv  (difficult) 

5.52 

>  1650.00 

4.02 

>415.00 

STRIPS  (deductive) 

6.11 

14.50 

3.94 

3.68 

LEAP  (rewrite) 

.35 

1.10 

.49 

2.24 

LEX2  (rewrite) 

.46 

1.60 

1.09 

1.47 

Equation  Solving 

8.08 

21.40 

S.7S 

2.44 

’“Examples  :n  cr.apter  6  ire  not  considered  since  EGGS  does  not  have  a  planner  for  mcependently  solving  problems  us¬ 
ing  STRIPS  operators. 
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from  the  different  but  related  learning  example  for  that  domain. :  In  all  of  the  theorem-proving 
examples,  the  backward-chaining  depth  bound  was  set  to  ten.  the  depth  of  the  shortest  known 
proof  for  the  hardest  problem  Uhe  ‘'difficult  geometry"  example). 

The  speedup  is  the  ratio  of  the  two  run  times  and  represents  how  much  faster  the  system 
solves  the  test  problem  after  learning.  In  the  simple  examples  where  little  or  no  search  is  involved, 
the  speedup  is  oniv  about  1.3:  however,  in  the  more  realistic  examples  with  larger  search  spaces, 
the  speedup  ranges  from  about  4  to  1400  times  faster.  Since  the  difficult  geometry  problem  shown 
Figure  4.17  could  not  even  be  solved  before  learning,  the  speedup  due  to  learning  from  a  sketchy 
p:oof  can  only  be  given  a  lower  bound  of  about  400. 

Empirical  results  on  positive  transfer  resulting  from  explanation-based  learning  have  also 
been  reported  for  the  PROLEARN  learning  PROLOG  system  [PrieditisS7j.  However,  only  one  of  the 
PROLEARN  examples  involves  any  search,  and  this  is  the  simple  SafeToStack  example  which 
requires  minimal  search.  Therefore,  no  large  speedups.  like  those  reported  here  for  the  MA  and 
difficult  Geometry  examples,  are  presented  in  the  PROLEARN  results.  More  comprehensive  experi¬ 
ments  m  the  single  domain  of  proving  theorems  in  elementary  logic  (the  MA  domain)  are  reported 
in  [O'RorkeSTa.  0'RorkeS7b].  These  experiments  demonstrated  that  explanation-based  learning  can 
improve  performance  to  a  greater  degree  than  rote  learning.  On  a  set  of  52  theorems  from  Princi¬ 
ple  Mathemctica  [\Vhiteheadl3].  an  EBL  system  using  depth-bounded  search  proved  83%  of  the 
theorems,  compared  to  79%  for  a  rote-learning  system,  and  31%  for  a  non-learning  system  (all  sys¬ 
tems  were  given  the  same  depth-bound).  Over  all  the  problems,  the  EBL  system  searched  32%  less 
nodes  than  the  non-learning  system,  while  the  rote  learning  system  searched  20%  more  nodes  than 
the  non-learning  system.3  Some  recent  experiments  that  compare  a  non-learning  system,  a  standard 
EBL  system,  and  an  EBL  system  that  can  learn  iterative  macro-operators  are  reported  in 
[ShavlikSS], 

Ir.  order  for  the  learning  of  macro-rules  to  contribute  to  the  overall  performance  of  a  system, 
there  must  be  regularity  in  the  class  of  problems  to  which  the  system  is  applied.  It  must  be 
assumed  that  it  is  likely  that  the  system  will  encounter  a  problem  that  is  similar  to  some  past 
problem  from  which  it  has  already  acquired  a  macro-rule.  Fortunately,  this  assumption  is  prob¬ 
ably  not  an  unreasonable  one  since  most  domains  are  characterized  by  classes  of  problems  that  can 

;MoriiSea  versions  o:  the  learning  examples  were  used  to  test  the  Cup  and  Suicide  examples  presented  :n  cnaptcr  3. 

These  percentages  were  calculated  from  data  presented  ir.  [ORorlceS'b.  Table  3.1] 


be  solved  using  similar  techniques.  The  advantage  of  learning  from  examples  taken  from  the 
environment  instead  of  randomly  generating  macro-rules  for  various  combinations  of  rules  is  that 
the  examples  will  hopefully  characterize  the  types  of  problems  the  system  will  have  to  solve  in  the 
future. 

7.2.  Performance  Degradation 

In  addition  to  performance  gains,  there  is  also  a  price  to  pay  for  learning  macro-rules.  When 
the  system  attempts  to  solve  a  problem  that  cannot  be  solved  using  the  macro-rule,  it  wastes  time 
unsuccessfully  trying  to  apply  the  rule.  The  result  is  negative  transfer,  a  decrease  in  problem  solv¬ 
ing  performance  on  superficially  similar  problems  after  learning.  Table  7.2  summarizes  some 
empirical  data  that  illustrates  negative  transfer.  It  gives  the  CPU  time  in  seconds  required  to  solve 
a  problem  before  and  after  learning  a  rule  that  cannot  be  used  to  solve  the  problem.  Once  again, 
theorem-proving  examples  were  run  with  a  depth-bound  of  ten. 

Slowdown  is  the  ratio  of  the  two  run  times,  which  represents  how  much  faster  the  system 
solved  the  problem  before  learning.  The  test  problems  were  constructed  with  the  goal  of  maximiz¬ 
ing  this  slowdown  effect.  This  was  done  by  attempting  to  construct  goals  that  match  the  conclu¬ 
sion  of  the  macro-rule,  thereby  causing  the  system  to  try  (unsuccessfully)  to  use  the  rule  to 
achieve  the  goal.  Unfortunately,  this  was  only  possible  for  the  SafeToStack.  LEX2.  Geometry,  and 
STRIPS  examples  since  the  macro-rules  for  the  LEAP  and  MA  examples  do  not  have  antecedents 
and  equation  solving  examples  that  met  the  criteria  could  not  be  solved  without  adding  a  large 


Table  7.2:  Slowdown  Due  to  Learning 


Example 
;  SafeToStack 

;  LEAP 
MA 
.  LEX2 
Geometry 

:  STRIPS  (deductive) 

LEAP  (rewrite) 
LEX2  ( rewrite) 
Eauation  Solving 


Test  Problem 

SafeToStack(Objl.Obj2) 

(Given  -Fragile(Obj2)) 
Equiv(-(-av'b).?x) 

ML  =>  P-(PVQ) 

Solution! /3x_Idx.?a) 

AABD=ACBD _ _ _ 

(Given  AB=CB.  AD=CD) 
Achieve(inroom(  Robot. Room2).?s) 
(Given  facts  for  Figure  3.6) 
-'(-aVb) 

/  3x'‘dx 

ln(x*3)-ln(5)-c 


!  Solution  CPU  sec 
I  Before  \  After 


Slowdown 


1  Before  I  After  _ 

1.06  |  L09  LOT 

i 

1.49  1.50  1.01 

j  21.10  21.90  1.04 

;  2.19  2.39  1.09 

I  4.63  S.1S  1.77 

j  I 

3.62  !  >2100.00  >580.00 

I 

.64  |  .69  1.0S 

1 .28  j  1.41  1.10 

11.40  i  11.70  1.03 


m 


rC 


number  of  extra  rules.  In  an  attempt  to  maximize  slowdown  in  tnese  remaining  cases,  problems 
were  constructed  which  match  the  conclusion  of  the  macro-rule  as  much  as  possible. 

In  these  experiments,  the  slowdown  effect  of  learning  a  particular  rule  was  generally  much 
less  than  the  speedup,  so  the  gain  of  learning  the  macro-rules  probably  out-weighs  the  loss.  In  the 
STRIPS  example,  however,  learning  causes  the  system  to  waste  a  great  deai  of  time  attempting  to 
use  the  macro-rule  on  a  relatively  simple  problem  that  is  much  more  easily  solved  from  first  prin¬ 
ciples.  The  system  eventually  exhausts  its  storage  before  it  abandons  its  attempt  to  use  the  new 
macro-rule  to  solve  the  problem.  [MmionSf]  describes  a  more  comprehensive  experiment  in  a  sin¬ 
gle  domain  very  much  like  STRIPS'  domain  which  demonstrates  that  indiscriminately  learning 
macro-operators  can  eventually  lead  to  the  slowdown  effect  dominating  and  result  in  an  overall 
decrease  in  performance. 

In  these  cases,  it  is  important  to  be  selective  about  which  macro-rules  are  created  and  which 
are  retained.  The  problem  of  determining  whether  a  macro-rule's  positive  contribution  to  perfor¬ 
mance  outweighs  its  negative  effect  has  been  called  the  utility  problem  [\lintor.S7a],  The  possibility 
of  generating  large  numbers  of  macro-operators  which  adversely  affect  performance  was  first  men¬ 
tioned  in  the  conclusion  of  the  STRIPS  learning  paper  [Fikes72j.  Fikes  et.  cl  proposed  that  a  possi¬ 
ble  solution  to  this  problem  might  be  the  inclusion  of  a  mechanism  for  forgetting  infrequently  used 
macro-operators.  This  mechanism  would  keep  statistics  on  the  frequency  with  which  each  macro- 
operator  is  used  ar.d  discard  those  which  fell  below  a  certain  threshold.  The  learning  of  S-Macros 
in  MORRIS  [Mintor.55]  is  an  implementation  of  this  proposal.  Experiments  conducted  with 
MORRIS  indicate  that  this  technique  can  result  in  improved  performance  [Minton85].  Analytical 
methods  for  judging  the  utility  of  macro-operators  (i.e.  their  ability  to  improve  performance)  have 
aiso  been  developed  and  tested  [Iba85.  Minton85.  \lintonS7a].  If  macro-operators  that  are  known 
to  prevent  backtracking  (T- Macros)  are  selectively  retained,  empirical  results  indicate  th.v  overall 
performance  can  be  improved  [MintonS5].  Some  additional  untested  suggestions  for  dealing  with 
the  problem  of  slowdown  are  given  in  [PrieditisS 7 ].  Unlike  STRIPS  or  the  current  PROLEARN  sys¬ 
tem.  GEXESIS  does  not  learn  from  every  explanation.  The  criteria  GENESIS  uses  to  determine 
when  an  explanation  is  worth  learning  from  are  discussed  in  chapter  11. 

7.3.  A  Suggestion  for  Better  Controlled  Experiments 

Macro-rules  can  improve  a  resource-bounded  problem  solving  system  in  two  important  ways. 
It  can  increase  the  class  of  problems  the  system  is  inherently  capable  of  solving  and  it  can  increase 
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ihe  speed  of  problem  solving  A  good  experimental  comparison  of  the  performance  cf  a  learning 
and  non-learning  system  must  consider  both  of  these  factors.  An  experiment  that  examines  only 
one  of  these  factors  without  controlling  for  the  ocher  can  produce  misleading  results. 

Specifically,  when  the  problem  solver  is  depth-first  and  depth-bounded,  like  the  theorem 
prover  in  EGGS  and  in  [0'RorkeS7b].  performance  data  is  actually  biased  against  a  system  that 
learns  macro-rules.  Such  a  system  can  waste  a  great  deal  of  time  trying  to  use  a  macro-rule  to 
sole e  a  problem  by  combining  it  with  other  macro-rules  and  with  rules  in  the  original  domain 
theory-.  With  the  same  depth-bound  for  both  the  learning  and  non-learning  systems,  a  system  that 
learns  macro-rules  is  capable  of  solving  a  larger  set  of  problems.  This  is  because  a  macro-rule  that 
can  represent  an  arbitrarily  deep  proof  in  the  original  domain  theory  counts  as  only  one  rule  appli¬ 
cation  when  used  to  solve  new  problems.  Therefore,  the  search  space  the  learning  system  must 
explore  is  constantly  getting  larger  since  learned  rules  increase  the  branching  factor  while  the  depth 
of  its  search  tree  remains  constant.  On  the  other  hand,  the  search  space  the  nor.-iearning  system 
must  explore  remains  constant. 

For  example,  in  the  STRIPS  example,  since  the  learned  macro-rule  has  depth  7.  in  some  places 
the  learning  system  is  exploring  the  search  space  of  the  original  domain  theory  to  a  depth  of  16 
instead  of  10  when  using  this  rule.  This  in  turn  accounts  for  the  large  amount  of  time  spent  trying 
to  use  this  particuiar  macro-rule  on  the  slowdown-testing  problem  for  the  STRIPS  domain. 

In  summary,  in  order  for  the  overall  execution  time  of  the  learning  system  to  be  less  than  the 
non-learning  system's,  it  must  be  capable  of  both  solving  more  problems  and  solving  them  faster. 
A  fairer  comparison  of  overall  search  or  execution  time  would  require  a  controlled  experiment  in 
which  the  class  of  problems  solvable  by  both  systems  remained  constant.  This  could  be  done  by 
marking  the  depth  of  each  antecedent  of  a  macro-rule  in  terms  of  the  rules  in  the  original  domain 
theory  During  problem  solving,  each  antecedent  would  only  be  pursued  until  it  has  reached  the 
depth-bound  in  terms  of  rules  in  the  original  domain  theory.  This  would  prevent  large  amounts  cf 
time  oeing  consumed  trying  to  use  macro-rules  and  result  in  a  fairer  comparison  of  the  perfor¬ 
mance  of  the  learning  and  non-learning  systems. 


CHAPTER  8 


GENESIS:  SYSTEM  OVERVIEW 


This  chapter  concerns  the  organization  of  the  complete  GENESIS  narrative  processing  system. 
GENESIS  is  built  on  top  of  the  EGGS  explanation-based  learning  system  and  enhances  its  ability  to 
deal  with  plan-based  explanations.  The  narrative  understanding  process  in  GENESIS  is  used  to 
construct  explanations  for  character's  intentional  behavior.  EGGS  then  generalizes  these  explana¬ 
tions  to  produce  plan  schemata  which  GENESIS  can  then  use  to  improve  its  understanding  ability. 
Figure  S.l  illustrates  the  overall  architecture  of  the  system.  The  architecture  of  GENESIS  is  a 
straight-forward  instantiation  of  the  prototypical  explanation-based  learning  system  (PEBLS)  as 
defined  in  [SegreS7aj  in  which  the  performance  element  is  the  same  as  the  understander.  This 
chapter  briefly  summarizes  the  function  of  each  of  the  components  shown  in  Figure  8.1.  The 
schema  library,  understander,  causal  model,  schema  Learner,  word  learner,  and  indexer retriever  are 
discussed  in  detail  in  subsequent  chapters. 

8.1.  The  Parser  and  Lexicon 

An  adaptation  of  McDYPAR  [DverS3]  (a  micro-version  [SchankSlb]  of  the  parser  used  in  the 
BORIS  system)  is  used  to  parse  English  sentences  into  predicate  calculus  wffs.  The  parser  makes 
uses  of  word  definitions  stored  in  the  lexicon.  Numerous  examples  of  the  output  of  the  GENESIS 
parser  are  given  for  the  example  narratives  discussed  in  chapter  10  and  appendix  B.  Since  parsing  is 
not  the  focus  of  the  current  research,  no  claims  are  made  about  the  parser  and  alternative 
approaches  could  be  used  for  this  task  (e.g.  [HirstST.  MarcusSO.  WaltzS4]). 

8.2.  The  Understander,  Schema  Library,  and  Causal  Model 

The  output  of  the  parser  is  processed  by  the  understander  which  attempts  to  construct  expla¬ 
nations  for  characters'  actions.  The  understander  uses  the  system's  existing  knowledge  of  actions 
and  inferences  stored  in  the  schema  library  in  order  to  find  connections  between  character's  actions 
and  infer  the  ultimate  goals  that  motivated  their  actions.  The  embellished  representation  that  it 
constructs  for  a  narrative  ts  referred  to  as  the  causal  model.  The  explanation  for  a  particular  goal 
achievement  is  the  subset  of  the  causal  mode!  that  supports  the  given  goal  state.  The  representation 
of  actions  used  in  GENESIS  is  an  enhancement  of  the  STRIPS-like  representation  presented  in 
chapter  6  and  :s  discussed  in  chapter  9.  The  techniques  used  in  the  understanding  process  are 
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similar  to  those  used  :n  previous  work  in  narrative  understanding  (Charniak77.  CullingfordTS. 
DeJongSZb.  DverS3.  SchankSlb.  WilenskyS3]  and  are  discussed  m  charter  10. 

8.3.  The  Question-Answerer 

A  simple  question-answerer  is  used  to  inspect  the  causal  model  built  by  the  understander. 
Since  the  focus  of  the  system  is  on  the  construction  of  generalizable  explanations,  this  sub-system 
is  primarily  used  to  answer  questions  about  why  a  character  performed  a  certain  action  or  why  a 
particular  state  exists. 

The  question-answerer  takes  questions  from  the  user  after  they  have  been  parsed  and 
employs  a  number  of  heuristics  for  retrieving  an  answ-er  from  the  causal  model.  For  example,  one 
heuristic  for  answerering  a  question  about  why  a  character  performed  a  particular  action  is  to  find 
another  action  that  this  action  enabled  and  reply  that  the  character  executed  the  action  so  he  could 
perform  the  subsequent  action.  The  answer  to  the  following  question  about  the  kidnapping  narra¬ 
tive  presented  in  cnapter  2  was  produced  using  this  heuristic. 


>Why  did  John  approach  Mary9 
So  John  could  aim  the  gun  at  Mary. 


The  generator  is  used  to  translate  the  question-answerer's  replies  into  English. 

Of  course,  there  are  many  difficult  issues  involved  in  the  retrieval  of  the  most  appropriate 
answers  to  questions  that  the  question-answerer  in  GENESIS  does  not  address  (see  [Lehnert7S. 
WaltzTS]  for  discussions  of  such  issues).  Consequently,  its  answers  may  sometimes  seem  a  little 
strange  or  inappropriate. 

S.4.  The  Paraphraser 

A  paraphrasing  system  is  used  to  output  information  the  system  has  obtained  from  its  pro¬ 
cessing  of  a  story.  The  purpose  of  this  sub-system  is  not  to  concisely  summarize  the  plot  or  the 
point  of  a  narrative  (as  in  [LehnertS2.  WilenskyS2]),  but  rather  to  simply  convey  the  system's 
overall  understanding  of  a  piece  of  text. 

If  a  narrative  is  recognized  as  an  instance  of  a  known  schema  or  if  a  new  schema  is  learned 
from  the  narrative,  that  schema  is  used  to  paraphrase  the  text.  This  is  accomplished  by  printing 
out  the  instantiated  actions  composing  the  schema  in  chronological  order.  In  addition,  before  print¬ 
ing  out  each  action,  any  of  its  preconditions  that  have  not  previously  been  mentioned  are  also 
stated.  The  generator  is  used  to  compose  English  sentences  for  the  assertions  produced  by  the  para¬ 
phraser.  This  technique  for  paraphrasing  an  instance  of  a  schema  is  also  used  to  produce  English 
descriptions  of  general  schemata  (e.g.  the  English  description  of  CaptureBargain  shown  in  the  trace 
:n  chapter  2).  If  a  particular  narrative  cannot  be  interpreted  as  an  instance  of  either  an  old  or  a 
new  schema,  the  paraphraser  simply  generates  English  sentences  for  each  of  the  individual  actions 
m  the  text. 


3.5.  The  Generator  and  Vocabulary 

An  adaptation  of  Mc.MUMBLE  [SchankSlb]  (a  micro-version  of  the  generator  used  in  the 
TALE-SPIN  narrative  generating  system  [Meenan76])  is  used  to  translate  the  output  of  the 
question-ar.sw-erer  and  the  paraphraser  into  natural  language.  Each  predicate  the  system  knows  has 
a  short  program  associated  with  it  which  instructs  it  how-  to  state  an  instance  of  the  predicate  in 
English.  Tnese  programs  are  stored  in  the  vocabulary.  Unfortunately,  like  many  natural  language 
systems.  GENESIS'  knowledge  of  words  is  separated  into  two  different  "lexicons."  one  for  parsing 
a.nc  one  for  generating.  A  system  that  uses  one  lexicon  for  both  tasks  is  presented  in  [WilenskySOj 
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8.6.  The  Schema  Learner 

The  schema  learner  analyzes  the  causal  model  built  by  the  understander  in  order  to  learn  new 
plan  schemata.  It  first  monitors  the  causal  model  and  detects  when  a  character  has  achieved  an 
important  goal  in  a  novel  manner  (i.e.  in  a  manner  for  which  the  system  does  not  already  possess  a 
schema).  The  explanations  for  how  such  goals  are  achieved  can  be  used  to  learn  new  schemata.  In 
addition  to  deciding  when  to  learn,  this  sub-system  includes  an  explanation  pruner,  generalizer.  and 
packager  as  defined  in  chapter  3.1  After  the  explanation  for  the  goal  achievement  is  extracted  from 
the  causal  model,  a  special  pruning  function  is  used  to  remove  actions  and  states  that  only  support 
the  goal  through  more  abstract  concepts.  The  pruned  explanation  is  then  generalized  using  the 
EGGS  explanation  generalization  algorithm.  Next,  an  enhancement  of  the  procedure  for  building 
macro-operator  discussed  in  section  6.2.2  is  used  to  package  the  generalized  explanation  into  a 
schema.  Finally,  this  schema  is  stored  in  the  schema  library  and  indexed  so  that  it  can  be  used  to 
aid  the  understanding  of  future  narratives.  The  pruning,  packaging,  and  indexing  procedures  as 
weil  as  the  procedure  that  decides  when  to  learn  are  all  discussed  in  detail  in  chapter  11. 
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8.7.  The  Word  Learner 

An  additional  feature  that  has  been  added  to  GENESIS  is  the  ability  to  learn  provisional 
definitions  for  unknown  schema-related  words.  For  example,  as  shown  in  the  trace  in  chapter  2. 
GENESIS  acquires,  from  a  single  narrative,  definitions  for  "kidnapper.''  "ransom."  and  "kidnap"  as 
well  as  a  kidnapping  schema.  The  word-learning  process  involves  detecting  the  conceptual  role  that 
an  unknown  word  fills  in  the  schema  that  is  being  learned.  After  a  word  is  associated  with  a  slot 
in  a  new  schema  or  with  the  new  schema  itself,  a  definition  for  the  new  word  is  constructed  for 
both  the  lexicon  and  the  vocabulary.  This  allows  both  the  parser  and  generator  to  use  the  new 
word  for.  respectively,  understanding  narratives  and  answering  questions.  GENESIS'  ability  to 
learn  new  words  is  discussed  in  further  detail  in  chapter  12. 

8-8.  The  Lndexer/Retriever  and  the  Long  Term  Store 

GENESIS  also  has  the  ability  use  the  schemata  it  learns  to  index  and  retrieve  specific  narra¬ 
tives.  After  processing  a  narrative,  the  indexer  stores  the  causal  model  constructed  for  this  piece  of 
text  in  the  long-term  store  and  indexes  it  under  the  most  comprehensive  schemata  used  in 

‘in  PE3LS  (SegreS'ai.  the  :t:c  generalizer  is  used  to  refer  to  the  entire  schema  learning  module:  however,  in  GENESIS, 
this  term  is  used  soiely  for  the  EGGS  explanation  generalization  process. 
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ir.terrretmg  the  story.  When  answering  questions  about  a  particular  narrative,  the  reinevcr  can  be 
instructed  to  retrieve  past  episodes  that  are  indexed  under  the  same  schema  used  to  interpret  the 
current  text.  These  modules  allow  GENESIS  to  function  as  a  conceptual  retrieval  system 
[SchankSla]  which,  during  normal  operation,  automatically  learns  new  ways  to  index  events. 
Further  details  on  this  aspect  of  GENESIS  are  given  in  chapter  13 


8.9.  The  Importance  of  the  Complete  System 


Although  GENESIS  is  primarily  a  tool  for  exploring  learning,  this  chapter  has  revealed  that  it 
contains  a  large  number  of  components  that  have  no  direct  connection  to  learning,  such  as  a  parser, 
question-answerer,  paraphraser.  etc..  An  important  question  concerns  the  relevance  of  these  addi¬ 
tional  components.  While  the  non-learning  components  in  GENESIS  certainly  make  the  learning 
svsiem  easier  to  test  and  debug,  they  also  serve  a  more  important  purpose.  The  primary  goal  of 
learning  is  to  improve  the  performance  of  an  overall  system  in  some  relevant  task  domain.  Several 
AI  researchers  have  stressed  the  importance  of  viewing  learning  in  the  context  of  a  performance 
svstem  [Buchanan77.  Mitchell83.  SimonS3j.  The  requirement  of  improving  overall  system  perfor¬ 
mance  often  imposes  important  constraints  on  a  learning  system.  As  a  result,  certain  degrees  of 
freedom  are  eliminated  that  couid  otherwise  be  unfairly  manipulated  to  bias  the  performance  of  an 


isolated  learning  system.2 


For  exampie.  in  GENESIS,  the  same  explanations  that  support  learning  must  also  be  used  to 
answer  questions  about  the  text,  and  the  schemata  produced  by  the  learning  mechanism  must  be 
able  to  aid  the  understanding  of  future  narratives  as  weil  as  support  reasonable  paraphrases.  Many 
representations  suitable  for  learning  are  unsuitable  for  performing  these  other  tasks.  Most  other 
EBL  systems,  on  the  other  hand,  do  not  use  the  explanations  they  construct  for  any  task  other  than 
to  support  generalization.  In  fact,  most  do  not  even  retain  the  entire  generalized  explanation,  but 
rather  extract  only  a  macro-rule  or  macro-operator  that  summarizes  the  overall  preconditions  and 
cost-conditions.  These  data  structures  do  not  generally  contain  information  on  the  sub-structure 
that  composes  and  justifies  them.3  In  GENESIS,  the  entire  generalized  explanation  is  retained  in 


order  to  be  able  to  use  the  learned  schema  to  understand  future  narratives  and  answer  questions 


about  them. 


2  A  similar  point  is  made  ir.  ■  Falkenr.air.erS7  (section  4.P;. 

JThe  generalized  Triangle  tables  retained  by  STRIPS  do  contain  much  of  this  information.  This  knowledge  is  used  to 
detect  ar.c  correct  problems  encountered  during  execution  monitoring. 
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Complementing  the  impact  that  the  understanding  system  has  on  the  iearning  system  is  the 
impact  that  the  learning  system  has  on  the  understanding  system.  Building  explanations  sufficient 
to  support  generalization  and  schema  acquisition  is  an  important  test  of  how  well  a  text  processing 
system  has  actually  "understood''  a  piece  of  text.  Previous  tasks  that  have  been  used  to  test  under¬ 
standing  are  question-answering  [Lehnert7S],  paraphrasing  [Cullingford7S,  DeJong82b].  and  sum¬ 
marizing  [LehnertS2].  Unlike  these  tasks,  the  ability  to  construct  an  explanation  sufficient  for  gen¬ 
eralization  is  a  crucial  test  of  how  well  the  system  has  comprehended  the  global  causal  structure  of 
a  piece  of  text.  Answering  why  and  how  questions  are  good  ways  of  testing  comprehension  of  local 
causal  structure:  however,  unlike  question-answering,  generalization  requires  the  construction  of  a 
globally  consistent  causal  model.  Paraphrasing  and  summarizing  require  having  an  adequate  global 
structure  for  the  text:  however,  they  do  not  generally  test  the  understanding  of  causal  structure. 
For  example.  FRUMP  [DeJong82b]  produced  summaries  of  newspaper  articles  using  sketchy  scripts 
which  contain  only  temporally  ordered  lists  of  events  with  very  little  causal  information.  In  sum¬ 
mary,  for  mans  narratives  (e.g.  the  examples  in  this  thesis),  it  is  fair  to  say  that  a  system  has  not 
truly  "understood"  the  text  unless  it  has  the  ability  to  produce  an  explanation  that  could  be  gen¬ 
eralized  into  a  new  schema  using  EBL  techniques.  Therefore,  the  ability  to  learn  a  schema  from  a 
narrative  can  be  viewed  as  an  additional  test  of  the  understanding  abilities  of  a  natural  language 
system. 
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CHAPTER  9 

GENESIS:  SCHEMA  REPRESENTATION 


All  domain  specific  knowledge  in  GEXESIS  is  represented  in  a  declarative  fashion  in  the 
schema  library.  This  knowledge  is  divided  into  information  about  objects,  attributes,  states,  and 
actions.  This  chapter  is  comprised  of  separate  discussions  of  the  representations  used  for  each  of 
these  classes  of  knowledge.  A  complete  listing  of  all  of  GEXESIS'  initial  knowledge  is  given  in 
appendix  C. 

9.1.  Objects 

An  isc  hierarchy  is  used  to  specify  a  taxonomy  of  object  classes.  The  complete  GEXESIS 
object  hierarchy  is  shown  in  Figure  9.1.  1  The  system  uses  standard  path-finding  inheritance  tech¬ 
niques  to  determine  class  membership  [Charniak85.  XilssonSO].  If  a  path  is  found,  the  system  con¬ 
structs  a  proof  of  class  membership  based  on  the  path.  This  proof  is  composed  of  inference  rules 
such  as: 

IsaC’x.  Gun)  —  Isa(?x.  Weapon) 

For  efficiency  reasons,  this  specialized  inference  technique  is  preferred  to  giving  the  EGGS  theorem 
proves  explicit  axioms  specifying  the  taxonomy  and  allowing  it  to  construct  proofs  of  class 
membership.  Since  queries  regarding  class  membership  are  quite  frequent,  a  specialized  strategy  for 
these  queries  substantially  increases  the  overall  efficiency  of  the  system. 

9.2.  Attributes  and  States 

Attributes  are  facts  about  objects  that  are  intrinsic  [Kedar-CabeiliS7b]  and  are  not  changed  by 
actions,  while  states  are  facts  about  objects  that  are  affected  by  actions.  GEXESIS’  hierarchies  of 
attributes  and  states  are  shown  in  Figure  9.2  and  Figure  9.3.  respectively.  Like  the  object  isa- 
hierarchy.  these  hierarchies  support  abstraction  inferences  such  as: 

Father('>x.'1v)  -•  Parent(?x.?y) 

Knowledge  of  attributes  and  states  also  includes  backward-chaining  rules  for  inferring  instances  of 
a  predicate  and  forward-chaining  rules  for  making  inferences  from  an  instance  of  a  predicate.  For 

’Special  notice  sr.oulc  be  taken  of  the  distinction  between  a  Character  and  a  Person.  A  Person  is  a  human  being  while  a 
Character  ,s  ar.v  agent  that  is  capaoie  of  executing  a  volitional  action  and  includes  Persons  as  well  as  collect. ve  agents  such  as 
Companies. 
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Figure  9.1:  GENESIS  Hierarchy  of  Objects 
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Figure  9.2:  GENESIS  Hierarchy  of  Attributes 


example,  the  following  rule  is  used  in  a  backward-chaining  fashion  to  infer  that  someone  possesses 
a  particular  amount  of  an  object  if  he  possesses  a  larger  amount  of  that  object: 

Amcunt(7v.?g.?u)  A  Possess(?a.7y)  A  Amount(?x.?l.?u)  A  Isa(7x.?t)  a  IsaC’v.'i’t)  A 
LessThan(7l  .?g)  —  Possess(?a.°x) 

In  this  rule.  Amount(?x.?l.?u)  specifies  that  the  object  7x  consists  of  a  number.  ?1.  of  units.  ?u  (e.g. 
AmountCMoney  1 .  100.  dollars)).  All  of  the  rules  attached  to  states  and  attributes  in  GENESIS  are 
given  in  section  2  of  appendix  B. 

9.3.  Actions 

Actions  specify  dynamic  changes  in  the  state  of  the  world  and  are  also  arranged  in  an  abstrac¬ 
tion  hierarchy.  A  graph  of  the  complete  GENESIS  action  hierarchy  is  shown  in  Figure  9.4.  Actions 
are  divided  into  volitional  actions,  such  as  Murder,  which  are  goal  directed  actions  willingly  exe¬ 
cuted  by  an  actor,  and  non-volitional  actions,  such  as  Die. 

Actions  are  represented  using  an  enhancement  of  the  STRIPS-iike  representation  discussed  in 
section  6.1.  The  first  modification  is  that  preconditions  are  divided  into  constraints,  preconditions. 
and  motivations.  Constraints  are  required  attributes  or  classes  of  the  arguments  of  an  action  ano 
therefore  can  not  be  achieved  by  other  actions.  Preconditions  are  states  that  enable  an  action  and 
can  be  achieved  by  other  actions.  Motivations  are  mental  states  of  the  actor,  such  as  goals,  goal 
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Figure  9.3:  GENESIS  Hierarchy  of  States 
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Figure  9.4:  GENESIS  Hierarchy  of  Actions 
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priorities,  and  beliefs,  that  motivate  him  to  perform  a  volitional  action.’  Together,  constraints, 
preconditions,  and  motivations  are  called  the  supports  of  an  action.  As  in  the  representation  in 
chapter  6.  effects  are  states,  possibly  negated  ones,  resulting  from  the  execution  of  an  action.  An 
action  definition  includes  only  the  supports  and  effects  unique  to  that  particular  action.  Supports 
and  effects  that  are  inherited  from  more  abstract  actions  are  defined  at  that  level.  As  an  example, 
the  supports  and  effects  of  the  Murder  action  and  one  of  its  specializations  (Poison)  are  shown  in 
Table  9.1.  The  effect  stating  that  the  victim  is  dead  is  specified  only  in  the  definition  of  Murder. 
The  preconditions  that  the  actor  must  possess  a  poisonous  food  and  the  victim  must  be  hungry  and 
the  effects  that  the  poisonous  food  is  inside  the  victim  and  no  longer  possessed  by  either  the  actor  or 
victim  are  specific  to  Poison  and  are  specified  only  in  its  definition. 

Knowledge  about  an  action  may  also  include  information  about  its  expansion,  i.e.  its  decom¬ 
position  into  more  primitive  actions.  An  action  can  be  recursively  defined  as  a  macro-operator  that 
has  an  expansion  in  terms  of  other  actions  (which  may  also  be  macro-operators).  The  expansion  of 
an  action  is  also  analogous  to  the  body  of  a  script  [Schank77]  or  MOP  [Schank82].3  The  complete 
set  of  action  specifications  in  GENESIS  should  not  be  interpreted  as  defining  an  ultimate  set  of 
primitive  actions.  Action  definitions  that  do  not  include  expansions  are  not  necessarily  primitive 
actions:  rather,  they  should  generally  be  interpreted  as  currently  lacking  a  complete  definition. 
Although  such  action  definitions  must  eventually  bottom  out  in  a  set  of  primitives,  sets  that  have 
been  proposed  have  many  inadequacies  (Wilks75].  Therefore,  in  GENESIS,  no  commitment  is  made 
regarding  an  ultimate  set  of  primitives. 
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!  Table  9.1:  Sample  Definition  of  Action  Supports  and  Effects  ' 

!  Action 

Constraints 

Preconditions 

Effects  i 

!  Murder('la.?v) 
i  "‘’a  murdered  7v" 

1  Isa(?a.Character) 
Isa(?v. Person) 

j  Dead(?v)  j 

Poison(‘,a.?v.?p) 

:  "'’a  poisoned  ?v  with  ?p" 

j  Poisonous^p) 

!  Food(?p) 

PossessOa.?p) 

NeedSustenance(?v) 

Inside(?p.?v)  , 

-PossessOa.??)  j 
-Possess(7v.7p)  I 

‘The  distinction  between  preconditions  and  motivations  is  used  by  the  question-answerer  when  determining  the  ap¬ 
propriate  answer  to  a  vhy  question. 

,Scr;pts  are  deaned  directly  in  terms  ot'  primitive  conceptual  dependency  actions  [S<hank75i  while  MOPs.  like  ou:  ac¬ 
tions.  are  dehr.ed  recursively. 


The  expansion  of  an  action  schema  includes  subactions .  internal  states.  Links,  abstractions .  and 
ordering  constraints.  Subactions  are  the  set  of  more  primitive  actions  which  compose  the  action. 
Internal  states  (also  referred  to  as  internals)  are  states  that  are  true  sometime  during  the  execution 
of  an  action  but  are  neither  preconditions  nor  effects.  Links  define  the  relationships  among  the 
subactions,  supporters,  and  internal  states,  and  thereby  specify  the  causal  structure  of  the  expan¬ 
sion.  Abstractions  specify  the  relationships  between  facts  in  the  expansion  and  facts  in  the  expan¬ 
sions  of  more  abstract  actions.  Ordering  constraints  (also  referred  to  as  orderings )  are  additional 
constraints  on  the  temporal  ordering  of  the  subactions  that  are  not  implicit  in  the  support  structure 
specified  by  the  links.  These  are  needed  to  avoid  protection  violations  as  discussed  in  section  6.2.2. 
As  an  illustration  of  expansion  definitions,  the  expansions  of  Murder  and  Poison  are  shown  in  Table 
9.2  and  Table  9.3.  respectively.  A  link  of  the  form:  (Effect  (Subaction  2)( Effect  1))  specifies  that 
the  first  effect  of  the  action  is  an  effect  of  the  action's  second  subaction.  Other  links  are  interpreted 
in  a  corresponding  fashion.  An  abstraction  of  the  form  ((Subaction  3)  Murder  (Subaction  2)J 
specifies  that  the  third  subaction  of  the  action  corresponds  to  or  is  a  specialization  of  Murder's 
second  subaction.  Other  abstractions  are  interpreted  in  a  corresponding  manner. 


Table  9.2:  Expansion  of  Murder  1 

!  Subactions 

1  Internals 

Links 

|  GenesisAction(?a) 

,  Die(Tv) 

i 

j  State(?v) 
j  State(?o) 
i  BadHealih(?v) 

i 

i 

(Effect  (Subaction  l)(Internal  1))  j 

(Antecedent  (Internal  3)  (Internal  l))  j 

(Antecedent  (Internal  3)  (Internal  2)) 
(Precondition  (Subaction  2)  (Internal  3))  ! 
(Effect  (Subaction  2)(Effect  1))  j 

Table  9.3:  Expansion  of  Poison 


Subactions 

Internals 

Abstractions 

:  Atrans(?a.?p.?v.?l) 
Ingest(?v.l>p) 

Die(?v) 

Possess(?v.?p) 

BadHealth(7v) 

((Subaction  l)  Murder  (Subaction  1)) 
((Subaction  3)  Murder  (Subaction  2)) 
((Internal  1)  Murder  (Internal  1)) 

((Effect  1)  Murder  (Internal  2)) 

((Internal  2)  Murder  (Internal  3)') 

Links 


(Precondition  (Subaction  1 )  (Precondition  1)) 
(Effect  (Subaction  1)  (Effect  2)) 

(Precondition  (Subaction  2)  (Internal  1)) 
(Effect  (Subaction  2)  (Effect  1)) 

(Antecedent  (Internal  2)  (Constraint  l)) 
(Precondition  (Subaction  3)  (Internal  2))) 


1 


(Effect  (Subaction  l)  (Internal  1)) 
(Constraint  (Subaction  2)  (Constraint  2))  i 
(Motivation  (Subaction  2)  (Precondition  2).)  ; 
(Effect  (Subaction  2)  (Effect  3))  1 

(Antecedent  (Internal  2)  (Effect  1)) 


105 


A  graphical  display  of  the  expansions  of  Poison  and  Murder  and  how  they  relate  to  each  other 
is  shown,  in  Figure  9  5.  The  expansion  of  Murder  can  be  summarized  as  follows:  An  actor.  ?a,  per¬ 
forms  seme  action  that  causes  a  victim,  ?v.  to  be  in  a  state  that  implies  he  is  in  a  state  of  bad  health 
which  in  turn  enables  the  victim's  non-volitional  death.  The  expansion  of  Poison  can  be  summar¬ 
ized  thus:  Ar.  actor,  ?a.  gives  a  poisonous  food.  ?p,  to  a  victim.  ?v.  which  results  in  the  victim  pos¬ 
sessing  the  food.  Since  the  victim  is  hungry  and  has  the  food,  he  eats  the  food  that  results  in  the 
food  being  inside  him.  The  fact  that  the  food  is  poisonous  and  inside  him  implies  that  the  victim  is 
in  a  state  of  bad  health.  This  in  turn  enables  his  non-voiitional  death.  The  actor's  action  of  giving 
the  victim  poisonous  food  corresponds  to  the  actor's  unspecified  action  in  the  expansion  of  Murder 
while  the  state  of  bad  health  and  death  correspond  to  these  same  parts  in  Murder's  expansion.  The 
relationships  between  facts  and  more  general  abstractions  are  represented  as  logical  implications. 
For  example: 

Poison(?a.  ?v,  ??)  —  Murder(?a.  ?v) 

AtransC’a.  ?p.  ?v.  ?1)  —  GenesisAction(?a) 

It  should  be  noted  that  motivations  of  other  characters'  actions  in  the  expansion  of  a  voli¬ 
tional  schema  are  always  preconditions  of  the  overall  schema  or  the  effects  of  actions  performed  by 
the  volitionai  actor  of  the  schema.  For  example,  since  the  fact  that  the  victim  is  hungry  motivates 
him  to  ingest  the  poion.  the  victim  being  hungry  is  a  precondition  of  the  volitional  Poison  schema. 
The  simplifying  assumption  is  made  that  other  actors  will  always  perform  their  subactions  if  the 
motivations  for  these  actions  are  met.  This  insures  that  all  of  the  effects  of  the  action  will  be 
achieved  if  the  volitional  actor  performs  all  of  his  actions. 

A  final  type  of  information  attached  to  actions  is  suggestions.  A  part  of  each  action  definition 
is  a  list  of  iarger  schemata  of  which  that  action  is  a  subaction.  An  action  is  said  to  suggest  schemata 
of  which  it  may  be  a  part  and  which  therefore  may  provide  a  reason  for  its  execution.  This  process 
forms  a  crucial  part  of  the  understanding  mechanism  and  is  discussed  in  detail  in  the  following 
chapter. 

In  conclusion,  it  should  be  noted  that  the  representation  presented  in  this  section  has  beer, 
used  to  represent  a  variety  of  actions  needed  for  the  understanding  of  narrative  text.  Numerous 
additional  examples  of  action  definitions  using  this  representation  are  presented  in  section  1  of 
appendix  C. 
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NeedSustenancel  K 


Poisonous2 


Poison  1 


GeneslsApt 


Posse  ss3 


Deadl 
Murderl 
Poison  1 
Isa  7 

Poisonous! 

NeedSustenance  1 

Possess3 

Insidel 

Ingestl 

Possess4 

Atransl 

Not! 

Not3 

Die3 

BadHealthl 

State2 

Statel 

GenesisActionl 


Dead(°v) 

Murder(?a.?v) 

Poison(?a.?v,'7p) 

Isa(?p.Food) 

PoisonousOp) 

NeedSustenance(?v) 

Possess(?a,°p) 

Inside(?p.?v) 

Ingest(?v,?p) 

Possess(?v,?p) 

Atrans(?a.?p,?v.?l!) 

-Possess(?a.1p) 

-’Possess(?v.',p) 

Die(?v) 

BadHealth(?v) 

State(?p) 

StateC’v) 

GenesisAction(?a) 


•  Murderl 


Statel 


BadHealthl 


Atransl  •»=  Possess-J  -3*Tngestl  Insidel  ->*''State2 


?v  is  dead. 

°a  murdered  °v. 

°a  poisoned  ?v  with  ?p 
?p  is  a  food. 

?p  is  poisonous. 

?v  is  hungry. 

°a  had  ?p. 

?p  is  inside  ?v. 

?v  ate  ?p. 

?v  had  ?p. 

?a  gave  ?v  ?p. 

?a  does  not  have  ?p. 

?v  does  not  have  ?p. 

?v  died. 

?v  is  in  bad  health. 

’’p  is  in  some  state. 

°v  is  in  some  state. 

°a  did  something. 


Figure  9.5:  Expansions  of  Murder  and  Poison 
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CHAPTER  10 

GENESIS:  NARRATIVE  UNDERSTANDING 
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A  substantial  body  of  research  in  natural  language  processing  has  addressed  the  problem  of 
"understanding”  narrative  text.  Most  of  the  research  in  narrative  understanding  has  focussed  on 
constructing  explanations  for  character's  actions  and  inferring  missing  actions  that  are  only  impli¬ 
citly  mentioned  [Charniak85  (chapter  10).  SchankSlb].  Although  these  are  difficult  problems 
which  are  far  from  completely  solved.  a  reasonable  amount  of  progress  has  been  made  and  a 
number  of  useful  mechanisms  have  been  developed. 

In  the  work  on  narrative  understanding,  a  distinction  can  be  made  between  scheme-based  (also 
called  script-based)  and  plan-based  understanding  mechanisms  [Schank77].  A  schema-based  under¬ 
standing  mechanism  attempts  to  directly  and  efficiently  access  a  relatively  specific  knowledge 
structure  (a  schema)  that  accounts  for  the  actions  in  the  text.  This  schema  can  then  be  efficiently 
used  to  causally  connect  actions  in  the  narrative  and  fill  in  missing  actions.  In  terms  of  problem 
soivtng.  it  is  analogous  to  using  a  macro-operator  to  solve  a  particular  problem  in  one  step.  Exam¬ 
ples  of  systems  that  use  schema-based  understanding  are  SAM  fCuI!ingford7S]  and  FRL'MP 
’DeJong"9j. 

A  plan-based  understanding  mechanism,  on  the  other  hand,  can  be  used  for  understanding 
novel  situations.  It  involves  searching  for  a  set  of  missing  actions  that  causally  connect  to  observed 
actions  to  form  a  plan  that  achieves  a  character's  goal.  In  terms  of  problem  solving,  this  approach 
as  more  like  doing  search-intensive  planning  to  achieve  a  goal.  In  Newell's  terminology  [\'ewell73j. 
plan-based  understanding  is  a  weak  method  while  schema-based  understanding  is  a  strong  method. 

PAM  [WilenskyTSj  is  an  example  of  a  system  that  does  plan-based  understanding.  Although 
planning  and  pian-based  understanding  can  make  use  of  the  same  database  of  actions,  they  are 
quite  different  processes.  In  planning,  one  is  given  an  initial  state  and  a  goal  and  must  search  for  a 
plan  that  can  be  executed  in  the  initial  state  and  that  achieves  the  goal.  In  plan-based  understand¬ 
ing.  one  is  given  a  set  of  observed  actions  and  must  search  for  a  plan  that  induces  or  "covers"  these 
actions  and  tnat  achieves  a  character's  goal.  In  understanding,  goals  must  usually  be  inferred  since 
they  are  rarely  explicitly  mentioned. 

Since  a  robust  understanding  system  must  be  able  to  deal  with  both  mundane  and  novel 
situations,  several  recent  understanding  systems  perform  both  schema  and  plan-based 
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understanding  Fcr  example.  BORIS  [DyerS3]  and  FALSTUS  [Norv:gS3.  W'ler.sk\N3;  are  systems 
that  use  noth,  or  these  understanding  mechanisms.  GENESIS  also  employs  both  schema  and  plan- 
cased  mechanisms  in  its  understanding  process.  First,  the  system  tries  to  find  an  action  schema 
that  will  directly  explain  the  characters'  acttons.  If  this  fails,  it  resorts  to  trying  to  causallv  con¬ 
nect  individual  actions  m  a  plan-based  manner.  However,  the  search  it  performs  during  plan-based 
understanding  is  very  limited  in  order  to  prevent  it  from  spending  an  inordinate  amount  of  time 
performing  combir.atorially  explosive  search  through  the  space  of  all  possible  explanations.  Conse¬ 
quently.  GENESIS  cannot  produce  adequate  explanations  for  certain  narratives  although  it  theoreti¬ 
cally  could  understand  them  given  an  exhaustive  search  algorithm  and  unlimited  time  and  space. 
Nevertheless,  the  system's  limited  ability  to  do  plan-based  understanding  allows  it  to  construct 
explanations  for  many  novel  plans  presented  in  narratives.  The  system  then  uses  the  EGGS  gen- 
eraiizer  to  perform  explanation-based  learning  on  these  explanations  in  order  to  learn  new  schemata 
which  allow  it  to  understand  future  instances  of  the  plan  using  schema-based  understanding. 
Explanation-based  learning  in  GENESIS  can  therefore  be  viewed  as  the  acquisition  of  schemata  that 
ailow  the  system  to  process  narratives  using  efficient  schema-based  techniques  which  previously 
couid  only  have  been  understood  using  inefficient,  search-intensive,  plan-based  understanding. 

In  order  to  understand  exactly  how  learning  can  improve  performance  and  allow  GENESIS  to 
construct  explanations  for  narratives  that  it  previously  could  not  comprehend,  detailed  knowledge 
of  the  understanding  process  is  needed.  The  remainder  of  this  chapter  examines  the  schema  and 
plan-based  understanding  mechanisms  in  GENESIS.  A  narrative  used  in  the  acquisition  of  a 
"murder  for  inheritance"  schema  (an  exampie  due  to  Paul  O'Rorke  and  introduced  in  [DeJong53v 
is  used  to  illustrate  these  processes.  The  English  text  and  parsed  version  for  this  narrative  are 
show-n  in  Figure  10. 11  and  additional  action  definitions  needed  for  this  narrative  are  shown  in  Tabie 
10.1.  A  detailed  system  trace  for  this  example  is  presented  in  appendix  D.  The  reader  may  find  ;t 
helpful  to  refer  to  this  trace  while  reading  subsequent  passages  describing  the  processing  of  this 
example. 

10.1.  Schema  Selection 

If  a  schema-based  mechanism  is  to  be  able  to  process  a  broad  range  of  texts,  it  must  have 
access  to  a  large  number  of  schemata.  Therefore,  m  order  to  avoid  repeated  searching  through  the 
entire  database  of  schemata,  it  must  aiso  have  an  efficient  method  for  selecting  the  particular 

•  Va:;ai.«  r  parser  output  sr.cu.d  be  .nterpreteri  as  be:-?  existentially  Quantified. 


aaasss&s&asisie 


.T.T.1 


Claudius  was  Agrippina's  husband  ana  owned  an  estate. 
Agrippina  gave  him  a  poisonous  mushroom  and  he  died. 
She  inherited  the  estate. 


Isa( Person  1. Person),  Gender(Personl.male).  Name(Personl.Claudius).  lsa(Person2.Person). 
Gender(Person2,female).  N'ame(Person2 .Agrippina),  IsaCEstatel  .Estate).  Husband( Person  1 . 
Person2).  Possess(Personl.Estatel).  Poisonous(Mushrooml).  lsa(Mushrooml .Mushroom), 
Atrans(Person2. Mushroom  1. Person  1.7AT6),  Die( Person  1).  lnherit(Person2.Estatel.?FROM3) 

Figure  10.1:  English  and  Parsed  Versions  of  the  Learning  Narrative  for  the  Murder  Example 


Table  10.1:  Additional  Action  Definitions  for  the  Murder  Example 


!  Action 

Constraints 

Preconditions 

Effects 

Suggestions 

j  Atrans(?a.?o.?t.?l) 

■  “?a  gave  ?t  ?o 
i  at  ?r 

Isa(?a.Character) 
IsaOo. Inanimate) 
Isa(?t.Character) 
Isa(?l.Location) 

PossessC’a^o) 

Possess(?t.?o) 

-Possess(?a.?o) 

Atrans(?a.?o.?t) 

Poison(?a.?t.?o) 

j  Die(?a) 

"?a  died" 

Isa(?a. Person) 

.  . 

Dead(?a) 

,  Inhent(?a.?o.?d) 

'  "?a  inherited 

:  ?d's  ?o* 

Isa(?a.Character) 
Isa(?d.Person) 
Isa(?o. Inanimate) 

Heir(?a.?d) 

Possess(?d.?o) 

Dead(?d) 

Possess(?a.?o) 

-Possess(?d.?o) 

schemata  that  are  applicable  to  the  current  input.  This  process  is  frequently  referred  to  as  schema 
selection  or  frame  selection.  It  is  a  difficult  problem  and  has  oeen  the  subject  of  several  research 
efforts  [Charniak78.  CharniakS2.  DeJong82b.  LvtinenS4,  Norvig83]. 

In  GENESIS,  the  process  of  selecting  a  schema  has  two  stages:  suggestion  and  determination. 
An  action  A  in  a  narrative  suggests  a  schema  B  if  A  is  a  subaction  of  B.  Each  suggested  schema  is 
instantiated  (  i.e.  a  unique  instance  is  created  with  the  appropriate  variable  bindings)  and  monitors 
the  inputs  in  order  to  find  confirming  evidence.  When  a  schema  has  found  enough  confirming  evi¬ 
dence.  it  is  determined  and  its  complete  expansion  is  added  to  the  causal  model.2  An  early  discus¬ 
sion  of  this  approach  and  some  of  its  advantages  was  presented  in  [DeJongS4],  The  same  basic  tech¬ 
nique  was  also  used  in  the  original  implementation  of  GENESIS  [MoonevS5a.  MoonevSfb]. 


‘The  lera  determination  is  taker,  from  [Norvig33j  where  it  is  usee  to  refeT  to  a  similar  process  ir.  FAL'STUS.  Sugges¬ 
tion  GENESIS  corresponds  to  the  com o. ration  of  invocation  arc  instantiation  as  dehned  ir.  [Norv;gS3;. 
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10.1.1.  Schema  Suggestion 


A  specification  of  the  suggestion  process  is  shown  in  Figure  10.2.  As  mentioned  in  the  previ¬ 
ous  chapter,  an  action  definition  includes  suggestions  of  relevant  schemata.  For  example,  the 
definition  of  Atrans  given  in  Table  10. 1  includes  the  following  suggestion: 


Atrans(?a.?o.°t.?l)  Poison(?a.?t.?o) 


where  a  statement  of  the  form:  A  B  is  read:  A  suggests  B.  Therefore,  the  input. 

Atrans(Person2.  Mushrooml.  Personl.  ?AT6)  in  the  murder  story  potentially  suggests  the  schema: 
Poison(Person2.  Personl.  Mushrooml).  A  suggestion  is  not  complete,  however,  until  it  is  confirmed 


that  the  suggested  instance  obeys  all  of  the  constraints  on  the  schema.  For  the  example,  the  system 


must  first  prove  the  following  facts  using  the  deductive  retriever  in  EGGS. 


Poisonous(Mushrooml).  IsaCMushrooml.  Food).  Isa(Personl.  Character).  Isa(Person2.  Person) 


The  first  and  fourth  constraints  were  given  explicitly  in  the  input  and  the  second  and  third  ones  are 
easily  inferred  using  the  object  hierarchy.  Therefore,  the  instantiated  Poison  schema  is  added  to  the 


list  of  suggested  schemata  where  it  is  monitored  for  subsequent  determination.3  The  final  step  in 


procedure  SuggestSchemata(i) 

(*  Suggest  schemata  for  input  i  *) 

for  each  suggestion  b  in  the  action  definition  of  i  do 
if  i  unifies  with  a  w-ith  MGU  0 

then 

let  s  =  b0 
let  q  =  true 

for  each  constraint  c  of  s  do 

if  c  does  not  have  any  variables  A  -Retrieve(c) 
then  let  q  =  faise 

if  q 
then 

put  s  on  the  list  of  suggested  schemata 
for  each  super  b  of  action  s  in  the  action  hierarchy  do 
put  b  on  the  list  of  suggested  schemata 


Figure  10.2:  Procedure  for  Suggesting  Schemata 


’For  this  particular  example,  u  may  seem  that  the  input  Poisonous! Mushrooml )  should  directly  suggest  Poison  instead 
f  being  checked  by  a  suggestion  initiated  by  the  Atrans  action.  However,  in  order  to  keep  the  suggestion  mechanism,  simple, 
r.iy  suoactior.s  suggest  mgher- level  schemata  that  can  explain  them.  Allowing  constraints  to  suggest  schemata  would  great- 
y  complicate  the  process  o:  learning  suggestions.  Having  ail  constraints  suggest  a  schema  wouid  result  ;r.  .nputs  suen  as 


ag”? 


Ill 


% 


A 

* 


l- 


K 


/■/ 


/, 


jv 
A 
/  '*» 


3: 


the  suggestion  of  a  schema  is  the  suggestion  of  its  generalizations  in  the  abstraction  hierarchy.  For 
example,  when  the  Posion  schema  is  suggested  in  the  example  narrative,  it  also  causes 
MurderC Person!.  Person l)  to  be  suggested. 

In  general,  a  suggested  instance  of  a  schema  may  contain  existentially  quantified  variables. 
For  example,  if  the  arguments  to  Poison  had  included  a  location,  the  variable  used  to  denote  the 
location  of  the  Atrans  (?AT6)  would  be  included  in  the  suggested  schema.  Constraints  on  such 
variables  (e.g  Isa(?AT6.  Location))  are  not  checked  during  suggestion:  however,  any  constant  that 
:s  later  matched  to  such  variables  must  satisfy  these  constraints,  if  the  EGGS'  deductive  retriever 
were  used  during  suggestion  to  retrieve  constraints  with  variables,  it  would  lead  to  premature 
assignment  of  unknown  schema  variables.  In  the  example.  ?AT6  could  be  immediately  associated 
with  any  previously  mentioned  location. 

10.1.2.  Schema  Determination 

A  suggested  schema  continually  checks  inputs  in  the  narrative  to  see  if  they  match  facts  in  its 
expansion.  This  process  of  examining  suggested  schemata  for  determination  is  described  in  Figure 
10.3.  Each  fact  in  the  expansion  of  a  suggested  schema  is  checked  to  see  if  it  matches  the  input 
fact.  If  an  input  matches  a  suggested  fact  and  all  new  schema  variable  bindings  resulting  from  this 
match  satisfy  the  schema  constraints,  then  these  variable  bindings  are  applied  to  the  schema  expan¬ 
sion.11  The  first  input  that  is  checked  by  this  process  is  the  input  that  suggested  the  schema.  In  the 
example,  this  is  the  Atrans  input  and  it  matches  a  subaction  in  the  expansion  of  the  Posion  schema 
(shown,  in  Figure  10.4). 

Each  time  a  fact  in  the  expansion  of  a  suggested  schema  is  matched  to  an  input,  it  is  marked  as 
having  been  "found."  In  addition,  facts  related  to  the  "found"  fact  are  assumed  to  be  true  and  also 
marked  as  "found."  When  an  action  is  marked  as  "found",  all  of  its  effects,  preconditions,  motiva¬ 
tions.  and  constraints  are  also  marked.  This  is  because  if  an  action  has  occurred,  then  all  of  its  sup¬ 
ports  must  have  been  true  and  all  of  its  effects  must  now  be  true.  When  a  fact  is  marked  as 
"found",  all  of  its  antecedents  (if  it  has  any)  are  also  marked.  If  it  is  the  antecedent  of  another  fact, 
all  of  whose  antecedents  are  now  marked,  this  fact  is  also  marked  since  it  must  be  true  if  all  of  its 

isa1  Personl.  Person)  and  IsaCMushrooml,  Food)  suggesting  Poison,  which  is  clearly  worse.  Distinguishing  between  con- 
s ' r a i r. t s  such  as  Poisonous! Mushroom  1 )  and  Isa( Mushroom],  Food)  would  require  a  complex  analysis  of  the  irequency  of 
occurrence  of  such  constraints. 

FACSTUS,  the  corresponding  process  of  determining  the  values  of  additional  frame  slots  after  instantiation  s 
ra.len  tiooc'-c.'ion  [No-vigSj], 


procedure  CheckSuggestedSchemata(i) 

(*  Check  if  input  1  is  a  part  of  a  suggested  schema  *) 
for  each  suggested  schema  s  do 

for  each  fact  f  in  the  expansion  of  s  do 
if  f  and  i  unify  with  MGU  0  A 

any  newly  bound  schema  variables  in  0  satisfy  the  schema  constraints 
then 

for  each  fact  g  in  the  expansion  of  s  do 
replace  g  with  g 0 
AssumeFound(f) 

if  all  the  subactions  in  the  expansion  of  s  are  "found" 

then  determine  s  (i.e.  add  its  expansion  to  the  causal  model) 

procedure  AssumeFound(f) 

if  f  is  not  already  marked  as  “found"  then 
mark  f  as  "found" 
if  f  is  an  action 
then 

for  each  support  b  and  each  effect  b  of  f  do  AssumeFound(b) 
else 

for  each  antecedent  a  of  f  do  AssumeFound(a) 
for  each  consequent  c  of  which  f  is  an  antecedent  do 
let  q  =  true' 

for  each  antecedent  a  of  c  do 

if  a  is  not  "found"  then  let  q  =  false 
if  q  then  AssumeFour.d(c) 
for  each  action  a  of  which  f  is  an  effect  do 
let  q  =  true 

for  each  effect  e  of  a  do 

if  e  is  not  "found"  then  let  q  =  false 
if  q  then  AssumeFound(a) 
if  f  has  an  abstraction  b  or  is  an  abstraction  of  b 

where  b  is  in  the  expansion  of  another  suggested  schema 
then  AssumeFound(a) 

Figure  10.3:  Procedure  for  Checking  a  Schema  for  Determination 
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NeedSustenancel  w 

Poisonous!  ^a^_Poison  1 


»•  Not! 
»  Not3 


I  \\^r  SenesisA^ri^TT^-^^^State  1 
Possess3  -^Au-ansl  *r"’Possess4  -^"Tngestl  ^  Inside  1  -*r'State2 


— >■  Murder  1 
State  1  \ 


BadHealthl  -»•  Die3 Deadl 


Dead  1 

Dead(  Person  1 ) 

Claudius  is  dead. 

Murderl 

Murderl  Person!, Person  1 ) 

Agrippina  murdered  Claudius. 

$ 

Poisonl 

Poisonl  Person!. Person  1  ..Mushroom  1 ) 

Agrippina  poisoned  Claudius  with  the 

Jy 

mushroom. 

Isa* 

Isa(  Mushroom  1, Food) 

The  mushroom  is  a  food. 

«• 

Poisonous2 

Poisonous!  Mushroom  1 ) 

The  mushroom  is  poisonous. 

N  eedS  ustenancel 

NeedSustenancelPersonl ) 

Claudius  is  hungry. 

*.0 

Possess3 

Possess!  Person!,  Mushroo  ml) 

Agrippina  had  the  mushroom. 

Insidel 

Inside(Mushrooml.Personl ) 

The  mushroom  is  inside  Claudius. 

% 

Ingest  1 

Ingest!  Person  1.  Mushroom  1 ) 

Claudius  ate  the  mushroom. 

si 

Possess-! 

Possess!  Perso  n  1  .Mushroo  ml) 

Claudius  had  the  mushroom. 

Atransl 

Atrans!  Person!, Mushroom  1  .Personl  .712 ) 

Agrippina  gave  Claudius  the  mushroom. 

m 

Not! 

-Possess! Person!, Mushrooml) 

Agrippina  does  not  have  the  mushroom. 

N’ot3 

-Possess(Personl.Mushrooml) 

Claudius  does  not  have  the  mushroom. 

Dieo 

Die!  Personl) 

Claudius  died. 

BadHealthl 

BadHealth!  Personl ) 

Claudius  is  in  bad  health. 

"v* 

State2 

State(.Mushrooml) 

The  mushroom  is  in  some  state. 

ft 

Statel 

State!  Personl) 

Claudius  is  in  some  state. 

Genesis  Action! 

Genesis  Action!  Person! ) 

Agrippina  did  something. 

Figure  10.4:  Expansions  of  Suggested  Poison  and  Murder  Schemata 


antecedents  are.  If  a  marked  state  is  the  effect  of  an  action,  all  of  whose  positive  effects  are  now 
marked,  this  action  is  also  marked.  This  results  in  assuming  suggested  actions  whose  effects  are  all 
known  (or  assumed)  to  be  true.  Finally,  any  facts  in  the  expansions  of  other  suggested  schemata 
that  are  abstractions  or  specializations  of  a  marked  fact  are  also  marked  as  "found."  Whenever  all 
the  subactions  in  the  expansion  of  a  suggested  schema  are  all  marked  as  "found."  that  schema  is 
determined.  This  adds  the  suggested  action  as  well  as  its  supports,  effects,  and  expansion  to  the 
causal  model  where  they  become  part  of  the  system's  ''interpretation"  cf  the  text. 

In  the  murder  example,  when  Atransl  (see  Figure  10.4)  is  found,  it  causes  precondition  Pos- 
sess3.  precondition  Not2.  effect  Possess4.  and  abstractions  Statel  and  GenesisActionl  all  to  be 
marked  as  "found".  The  next  input  in  the  example  is:  Die(Personl ).  This  input  matches  the  fact 
Die3  :n  the  expansions  of  both  the  Poison  and  Murder  schemata.  Although  it  does  not  result  in  any 
new  variable  bindings,  it  causes  all  of  the  other  facts  in  the  expansions  of  Murderl  and  Poisonl  to 


be  marked.  In  particular,  it  marks  the  fact  that  Claudius  ate  the  mushroom  Ingest  1)  because  he 
must  have  been  sick  (BadHealthl)  as  a  precondition  ol  dying  (Die3)  and  this  sickness  can  be 
inferred  from  the  fact  that  the  mushroom  is  poisonous  (Poisonous!;  and  inside  him  (Insidel)  which 
in  turn  is  the  only  positive  effect  of  the  Ingest.  Once  this  missing  action  is  inferred,  the  Poison 
schema  is  determined  since  all  of  its  subactions  (Atransl.  Ingest  1.  and  Die3)  have  either  been  found 
in  the  inputs  or  assumed  to  have  occurred.  The  determination  process  simply  adds  all  of  the  facts 
shown  in  Figure  10.4  to  the  causal  model. 

10.1.3.  Comparison  to  Other  Approaches  to  Schema  Selection 

Although  GENESIS'  understanding  system  is  primarily  intended  as  an  implementation  of 
established  techniques  rather  than  original  research  in  narrative  understanding,  its  approach  to 
schema  selection  is  somew'hat  different  from  previous  methods.  In  general,  its  approach  is  more 
conservative  and  gradual  than  previous  approaches. 

Both  SAM  [Cullingford7S]  and  FRUMP  [DeJong79]  selected  a  schema  based  on  the  conceptual 
representation  of  a  single  input.  SAM  worked  by  "...introducing  the  'largest.'  most  inclusive  script 
it  possesses  which  is  initiated  by  the  first  conceptualization"  [Schank81b  (p.  109)].  Although 
FRUMP  had  many  more  scripts  than  SAM  and  was  not  tied  to  activating  one  from  the  first  input, 
it  also  used  a  single  conceptualization  to  determine  a  relevant  script.  A  discrimination  net  was  used 
to  index  relevant  scripts  based  on  the  content  of  a  single  conceptual  dependency  form  [DeJong79]. 
The  problem  with  this  approach  is  that  it  may  prematurely  decide  on  a  schema  that  does  not  actu¬ 
ally  account  for  the  rest  of  the  inputs.  Since  neither  SAM  nor  FRUMP  was  abie  to  backtrack  and 
consider  alternative  interpretations,  such  a  mistake  results  in  misunderstanding  the  text.  More 
recent  work  has  addressed  the  issue  of  backtracking  and  considering  alternative  interpretations 
after  an  initial  schema  fails  to  account  for  later  inputs  [GrangerSO.  0’Rorke83.  Orejel-Opisso84],  In 
addition,  psychological  studies  indicate  that  humans  perform  backtracking  of  this  sort  when  under¬ 
standing  certain  stones  [CollmsSOj. 

Separating  schema  selection  into  an  initial  suggestion  process  followed  by  a  confirming  process 
of  determination  is  an  attempt  to  minimize  misunderstanding  and  the  need  for  backtracking  by 
oeir.g  mere  cautious  when  selecting  an  initial  schema.  The  system  does  not  commit  itself  to  a  par¬ 
ticular  schema  (i.e.  it  is  not  determined)  until  it  is  clear  that  it  can  account  for  other  inputs  in  the 
narrative.  Also,  the  process  of  inferring  missing  actions  that  complete  a  causal  chain  [SchankSlbj  is 
integrated  with  the  process  of  checking  a  schema  for  determination.  A  schema  is  determined  when 
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ail  of  its  subact-.ons  have  been  observed  or  can  be  inferred  as  filling  gaps  in  a  causal  chain  oi  events. 
However  or.ce  a  schema  is  determined.  GENESIS  is  incapable  of  retracting  it  and  cannot  backtrack 
to  consider  alternative  interpretations. 

Several  otner  recent  understanding  systems  have  also  taken  a  more  cautious  approach  to  ini¬ 
tial  schema  selection.  As  in  GENESIS,  the  basic  approach  is  to  insure  that  a  schema  accounts  for  a 
number  of  inputs  before  actually  selecting  it.  FAUSTLS  has  three  stages  in  its  process  of  selecting 
a  frame.  A  frame  is  first  invoked  then  instantiated  and  finally  determined  [Norvig83].  WIMP 
[CharmakSbj  uses  spreading  activation  to  “suggest"  relevant  frames.  A  process  based  on  Occam's 
Razor  then  selects  the  suggested  frame  that  accounts  for  the  most  inputs  while  making  the  least 
number  of  assumptions.  A  plan  recognition  technique  based  on  non-monotonic  deduction  is  dis¬ 
cussed  in  [KautzS6].  It  also  uses  an  "Occam's  Razor"  approach  which  picks  the  least  number  of 
plans  needed  to  account  for  all  of  the  inputs.  The  understanding  system  used  to  explain  observed 
assembly  sequences  in  the  ARMS  robotics  system  does  not  determine  a  schema  until  all  of  its 
subactions  have  been  directly  observed  [SegreS7a].  This  is  possible  in  the  ARMS  domain  since  one 
is  guaranteed  to  observe  a  complete  sequence  of  low-level  operators.  In  narrative  understanding, 
one  must  have  some  wav  of  inferring  missing  actions  since  stories  rarely  ( if  ever)  mention  all  of 
the  actions  that  took  place. 

10.2.  Causally  Connecting  Actions  Without  a  Schema 

If.  while  processing  a  narrative.  GENESIS  encounters  an  input  action  that  does  not  match  part 
of  an  existing  or  suggested  schema  nor  suggest  any  schemata,  it  attempts  to  explain  the  action  in  a 
more  plan-based  fashion  by  attempting  to  connect  the  new  action  to  the  effects  of  previous  actions. 
The  procedure  GENESIS  uses  to  connect  actions  without  a  schema  is  an  elaboration  of  the  plan 
verification  algorithm  given  in  section  6.2.1.  Since,  in  GENESIS,  this  process  is  only  attempted 
when  schema-based  understanding  cannot  be  used  to  explain  an  action,  a  description  of  it  is  embed¬ 
ded  m  the  complete  understanding  process  shown  in  Figure  10.5. 

The  implicit  goal  of  the  understanding  procedure  is  to  determine  an  action  cr  connected  set  of 
actions  taking  place  in  the  narrative  that  achieve  a  character's  ultimate  goal.  Since  narratives  do 
rot  always  explicitly  reveai  characters'  ultimate  goais.  the  system  must  have  a  way  of  inferring 
them.  An  aspect  of  Scha.nk  and  Abelson's  theory  of  goals  and  themes  [SchankTT]  is  used  to  deter¬ 
mine  suc.i  goais.  In  their  view,  a  goal  arises  either  as  a  subgoal  of  a  plan  to  achieve  a  higher  level 
goal,  or  as  a  result  of  a  theme.  Schank  and  Abelson  define  themes  as:  "containing  the  background 


for  each  parsed  input  1  do 
if  1  is  not  an  action 
then  ProcessFact(i) 
else  ProcessAction(i) 

procedure  ProcessFact(i) 
if  -'Retrieve(i) 
then 

Assert(i) 

CheckSuggestedSchemata(i) 

if  i  does  not  match  part  of  a  suggested  schema 

then 

let  g  =  Retrieve(ThemeGoalMet(',x.  i)) 

if  g  then  call  schema  learner  on  explanation  for  g 

procedure  Process  Action(i) 

Assert(i) 

CheckSuggestedSchemata(i) 

if  i  does  not  match  part  of  a  suggested  schema 

then 

if  i  does  not  achieve  a  thematic  goal 

then 

SuggestSchema  ca(  i ) 

CheckSuggestedSchemataf  i) 
if  i  does  not  suggest  any  schemata 
then 

for  each  support  s  of  i  do 

if  s  is  a  constraint  with  variables 
then  let  r  =  false 
else  let  r  =  Retrieve(s) 
let  0  be  the  MGU  of  s  and  r 
if  r  A  any  newly  bound  schema  variables  in  0 
satisfy  the  schema  constraints 
then 

equate  s  to  r 
replace  i  with  ;0 

for  each  support  b  and  effect  b  of  i  do 
replace  b  with  b0 
else  ProcessFact(s) 
for  each  effect  e  of  x  do 
let  d  =  Retrieve^ -e) 
if  d  then  Delete(d) 

ProcessFact(e) 

for  each  super  b  of  action  i  in  the  action  hierarchy  do 
Process  Action(b) 

Figure  10.5:  The  Understanding  Procedure 
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information  upon  which  we  base  our  prediction  that  an  individual  will  have  a  certain  goal  (p. 
132).  ’  Thematic  goals  are  therefore  defined  as  goals  that  arise  from  basic  human  wants  and  needs 
and  therefore  require  no  further  explanation.  They  are  further  classified  as  either  satisfaction  goals 
such  as  satisfying  hunger  or  sexual  desires,  achievement  goals  such  as  acquiring  money  or  power. 
enjoyment  goals  such  as  wanting  to  be  entertained,  and  preservation  goals  such  as  preserving  one's 
health  or  safety.  GENESIS  has  a  set  of  inference  rules  that  determine  when  a  thematic  goal  has 
been  achieved.  These  inference  rules  effectively  define  a  set  of  known  thematic  goals.  For  example, 
the  following  inference  ruie  defines  the  thematic  goal  of  wanting  to  possess  valuable  items: 

Possess! ?x.?y)  A  Vaiuable(?v)  —  TnemeGoalMet(?x.  Possess(?x.'>v)) 

The  complete  list  of  rules  for  inferring  that  a  thematic  goal  has  been  achieved  is  given  in  section  2 
of  appendix  C. 

The  outer  loop  of  the  understanding  procedure  in  Figure  10.5  sequentially  processes  each  of 
the  inputs  in  the  narrative.  If  an  input  is  not  an  action  and  cannot  be  inferred  from  what  is 
already  known,  then  it  is  first  added  to  the  causal  model.  Next,  suggested  schemata  are  checked  to 
see  if  the  input  matches  a  fact  in  one  of  their  expansions.  Finally,  the  system  tries  to  infer  that  the 
fact  achieves  a  thematic  goal  for  some  character.  If  it  does,  the  explanation  for  how  the  goal  was 
achieved  is  passed  to  the  schema  learner  for  possible  generalization.  In  the  murder  example,  all  but 
the  last  three  inputs  are  facts  that  simply  define  the  various  objects  in  the  narrative  and  the  rela- 
t.onships  between  them.  Ail  of  these  inputs  are  simply  asserted  in  the  causal  model. 

If  an  input  is  an  action,  it  is  first  checked  to  see  if  it  matches  an  action  in  the  expansion  of  a 
suggested  schema.  If  it  does,  this  schema  is  a  likely  explanation  for  the  action  and  the  system 
proceeds  to  the  next  input.  Otherwise,  the  system  checks  if  it  suggests  a  schema  ihat  might  explain 
it.  However,  if  an  action  achieves  a  thematic  goal,  it  is  not  used  to  suggest  schemata  since  the  sys¬ 
tem  already  has  an  explanation  for  why  the  action  was  performed  and  does  not  need  to  search  for 
an  explanation  in  terms  of  it  being  part  of  a  larger  schema.  An  action  is  said  to  achieve  a  thematic 
goal  if  one  of  ;:s  effects  can  be  inferred  as  satisfying  a  thematic  goal  of  the  actor.  This  process  of 
preventing  explained  actions  from  suggesting  schemata  was  first  discussed  in  [MooneyS5b]  but  was 
not  implemented  in  the  original  version  of  GENESIS.  The  importance  of  this  process  is  illustrated 
with  an  example  in  section  B.2.4  of  appendix  B. 

If  an  action  does  net  suggest  a  schema  that  can  explain  it.  then  the  system  attempts  to  connect 
its  preconditions  to  t.-.e  effects  of  previous  actions.  As  :n  the  plan  verification  procedure  given  in 
chapter  6.  the  system  tries  to  infer  each  of  the  supports  of  the  action  using  the  deductive  retriever 
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arc  if  u  fails,  it  simply  assumes  the  support  must  be  true  and  adds  it  to  the  database.  An  excep¬ 
tion  to  this  rule  is  that  if  a  constraint  contains  an  unbound  schema  variable,  then  the  system  does 
not  try  to  infer  that  constraint.  This  is  to  prevent  premature  assignment  of  unknown  variables. 
For  example,  if  a  particular  role  in  the  action  is  not  mentioned  and  this  role  has  a  constraint  that  it 
be  a  Person,  using  the  deductive  retriever  to  satisfy  this  constraint  could  assign  any  known  person 
to  this  role.  If  the  support  is  a  precondition  or  motivation  with  variables,  then  any  schema  vari¬ 
able  bindings  that  are  created  by  matching  it  to  a  known  fact  must  satisfy  the  schema  constraints. 

After  an  action's  preconditions  have  been  considered,  its  effects  are  asserted  and  deletions  are 
determined  as  in  the  procedure  in  section  6.2.1.  The  procedure  ProcessFact  is  used  to  assert  each 
effect  and  check  if  this  state  satisfies  a  thematic  goal  for  some  character.  The  final  step  in  processing 
an  action  is  to  call  ProcessAction  recursively  on  any  abstractions  of  the  input  action  in  the  action 
hierarchy. 

In  the  murder  example.  Agrippinr.a  giving  Claudius  the  poisonous  mushroom  and  Claudius 
dying  is  explained  in  terms  of  Poison  and  Murder  as  described  in  section  10.1.2.  When  an  action 
schema  is  determined,  the  procedure  ProcessAction  is  called  on  the  action  in  an  attempt  to  connect 
it  to  previous  actions.  The  following  facts  are  supports  of  the  Posion  action: 

isa( Mushroom  1.  Food).  Poisonous(Mushrooml). 

NeedSustenance1  Personl).  Possess(?erson2.  Mushrooml) 

The  first  one  is  inferred  from  the  input:  IsaC.Mushrooml.  Mushroom)  using  the  object  hierarchy. 
The  second  directly  matches  a  previous  input.  The  facts  that  Claudius  was  hungry  and  that  Agrip¬ 
pina  hac  the  mushroom  cannot  be  retrieved  and  are  assumed  to  be  true  and  asserted  in  the  data¬ 
base  The  following  facts  are  asserted  as  effects  of  the  Poison: 

Inside!  Mushrooml. Personl).  -Possess!  Person  1 .  Mushrooml).  -Possess! Perscn2.  Mushrooml) 

The  final  fact  deletes  the  precondition  that  was  just  assumed.  When  ProcessAction  is  called  on  the 
determined  Murder  action,  the  only  supports  are  the  constraints: 

isai  Personl.  Person;.  Isa(Person2.  Character) 

both  of  which  are  retrieved  from  input  facts.  The  only  fact  added  as  an  effect  is'  DeadCPerscnl ). 

The  final  input  in  the  example  is:  Inherit! Person2.  Estatel.  ’’FROM3).  which  states  '.hat  Agrip¬ 
pina  inherited  the  estate  from  someone.  This  does  not  match  any  action  in  a  suggested  schema  nor 
does  it  suggest  any  schemata.  Therefore,  the  system  tries  to  connect  it  to  precious  actions.  The 
surrorts  of  the  action  are: 
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Dead(',FR0M3).  Possess(',FROM3.  Esiatel).  Heir(Person2 .  °FR0M3). 

IsaC’FRONH.  Person).  Isa(Person2.  Character).  Isa( Estate  1.  Inanimate) 

The  first  support  matches  the  effect  of  the  Murder.  Dead(Personl)  by  binding  the  variable  TROM3 
to  Personl.  Since  Personl  is  a  Person,  this  binding  satisfies  the  schema  constraint:  Isa(?FROM3. 
Person)  and  this  identification  is  made  throughout  the  action.  The  presence  of  7FROM3  in  the  sup¬ 
ports  also  illustrates  why  constraints  with  variables  are  not  allowed  to  be  retrieved.  If  the  sup¬ 
port:  lsa(?FROM3.  Person)  were  considered  first.  "’FROM3  could  be  incorrectly  bound  to  Person2 
since  Isa(Person2.  Person)  is  in  the  database.  Preconditions  are  generally  less  likely  to  result  in 
such  mistaken  assignments.  A  still  better  approach  might  be  to  find  a  set  of  bindings  that  maximize 
the  number  of  supports  that  can  be  retrieved:  however,  such  a  process  could  require  searching  an 
exponential  number  of  possible  bindings.  Once  7FROM3  is  bound  to  Personl.  all  of  the  constraints 
are  easily  deduced  using  the  object  hierarchy  and  the  precondition  PossessCPersonl.  Estatel) 
matches  an  input.  The  remaining  precondition:  Heir(Person2.  Personl)  is  deduced  from  the  input: 
Husba.nd(Personl .  Person2)  using  the  following  inference  rules: 

HusbandC^x.  ?y)  —  Spouse(7x.  ?y) 

Spouse(7x.  ?y)  —  SpouseOy.  ’’x) 

Spouse(7x.  ?y)  —  He:r(7x.  ?y) 

Finally,  the  following  effects  of  the  Inherit  are  asserted: 

Possess(Person2.  Estatel).  -PossessCPersonl.  Estatel) 

The  second  effect  deletes  the  fact  that  Claudius  owned  the  estate.  The  first  effect  is  recognized  as 
achieving  a  thematic  goal  for  Agrippina  since  the  object  hierarchy  specifies  that  an  estate  is  a  valu¬ 
able  item.  The  explanation  for  how  this  goal  was  achieved  is  shown  in  Figure  10.6.  The  explana¬ 
tion  shown  in  the  figure  is  called  a  highest-level  explanation  since  it  includes  only  the  "largest"  or 
highest-level  schemata  that  support  the  goal.  Actions  that  only  support  the  goal  because  they  are  a 
part  of  the  expansion  of  a  higher-level  schema  are  not  included  in  the  highest-level  explanation. 
Consequently,  the  expansions  of  the  Murder  and  Poison  schemata  are  not  included  in  Figure  10.6. 

The  explanation  constructed  by  the  understanding  process  allows  the  system  to  answer  a 
number  of  questions  about  the  narrative.  Beiow  is  the  input/output  trace  produced  by  GENESIS 
w  her.  it  is  run  on  this  example: 

Input.  Claudius  was  Agrippina's  husband  and  owned  an  estate.  Agrippina  gave  him  a  poiso¬ 
nous  mushroom  and  he  d:ec.  She  inherited  the  estate. 


HusbanJl  — *»  Spousel  — -  Spouse 4  — *■  Heir2  v 


ThemeGoalMetl 
Namel  N'ame2 


ThemeGoai.Me-  1 

ThemeGoalMetl  Person:, 

Agrippina  is  happy  that  Agrippina  has  the 

Possess!  Person:. Estate! )) 

estate. 

Isal’ 

Isal  Estatel  .Valuable) 

The  estate  is  a  valuable. 

Isa3 

Isa<  Estatel  .Estate) 

The  estate  is  an  estate. 

Possess1? 

Possess!  Person;, Estatel ) 

Agrippina  has  the  estate. 

Inner:!  1 

Inherit!  Person:, Estatel, Personl ) 

Agrippina  inherited  Claudius's  estate. 

Isa  1 5 

Isa!  Estatel  .Inanimate) 

The  estate  is  an  inanimate  object. 

Isa : 

Isa!  Personl  .Person) 

Claudius  is  a  person. 

Isa  1 0 

Isal  Person:. Character) 

Agrippina  is  a  character. 

Isa2 

Isa!  Person:. Person) 

Agrippina  is  a  person. 

Dead  I 

DeadlPersonl ) 

Claudius  is  dead. 

Murder; 

Murder!  Person:,  Personl ) 

Agrippina  murdered  Claudius. 

Poisor.  1 

Poison!  Person:. Personl  ..Mushrooml ) 

Agrippina  poisoned  Claudius  viih  the 

mushroom. 

Isaf 

Isat  Mushrooml. Food) 

The  mushroom  is  a  food. 

Isa4 

'.sai  Mushrooml. Mushroom) 

The  mushroom  is  a  mushroom. 

?0;sor.ousl 

?o:so  nous!  Mushroom  1 ) 

The  mushroom  is  poisonous. 

NeedSustenar.ee; 

NeedSustenance!  Person  1 ) 

Claudius  is  hungry. 

Possess; 

Possess!  PersonLMushroomi ) 

Agrippina  had  the  mushroom. 

Inside! 

Inside!  Mushroom  1  .Personl ) 

The  mushroom  is  inside  Claudius. 

Not: 

-Possess!  Person:, Mushrooml ) 

Agrippina  does  not  have  the  mushroom. 

Notj 

-  Possess!  Person  1  ..Mushroom  1 ) 

Claudius  does  not  have  the  mushroom. 

Possess! 

Possess!  Personl  .Estate! ) 

Claudius  had  the  estate. 

Heir: 

Heir!  Person:. Personl ) 

Agrippina  is  Claudius’s  heir. 

Spoused 

Spouse!  Person:. Personl ) 

Agrippina  is  Claudius's  spouse. 

Spouse! 

Spouse!  Perso  n  1  .Person: ) 

Claudius  is  Agrippina's  spouse. 

Husband! 

Husband!  Person  1  .Person: ) 

Claudius  is  Agrippina's  husband. 

Not6 

-Possess! Personl, Estatel ) 

Claudius  does  not  have  the  estate. 

Gender! 

Gender!  Personl. male) 

Claudius  is  male. 

Gender: 

Gender!  Person:, female) 

Agrippina  is  female. 

Namel 

Name! Personl  .Claudius) 

Claudius's  name  is  Claudius. 

Name: 

Name!  Person:, Agrippina) 

Agrippina's  name  is  Agrippina. 

Figure  10.6:  Murder  Example  -  Specific  Explanation  (Highest-Level) 


Thematic  goal  achieved:  Agrippina  is  happy  that  Agrippina  has  the  estate 
Explanation  suitable  for  generalization.  Pruning. ..Generalizing. ..Packaging... 

Creating  Neve  Schema:  (.Murderlnherit  "z5  ?o4  '’dl) 

?a5  is  a  character.  ?dl  is  a  person.  °a5  murders  ?dl.  ?o4  is  an  inanimate  object.  ?o4  is  a  valu¬ 
able.  ?dl  has  ?o4.  7a5  is  7dl's  heir.  ?a5  inherits  Til's  '1o4. 

Having:  ?a5  murdered  ?dl.  suggest  Murderlnhent 

Ready  for  questions: 

>  Summarize 

Agrippina  murdered  Claudius.  Claudius  had  an  estate.  Agrippina  was  Claudius's  heir.  Agrip¬ 
pina  inherited  Claudius's  estate. 

>  Why  did  Agrippina  give  Claudius  the  mushroom? 

So  Claudius  would  eat  the  mushroom  and  so  Claudius  would  die. 

>  Why  did  Claudius  eat  the  mushroom'* 

Because  Claudius  was  hungry. 

>  Why  is  Claudius  dead7 

Because  Agrippina  poisoned  Claudius  with  the  mushroom. 

>  Why  did  Agrippina  kill  Claudius'* 

So  Agrippina  couid  inherit  Claudius's  estate. 

>  Why  did  Agrippina  inherit  the  estate'* 

Because  Claudius  was  Agrippina's  husband,  because  Claudius  bad  the  estate  and  because  Agrip¬ 
pina  poisoned  Claudius  with  the  mushroom. 

>  Why  is  Agrippina  Claudius's  heir'* 

Because  Claudius  was  Agrippina's  husband. 


10.3.  Limiting  Search  in  Lnderstandmg 

GH.MiSIS's  understanding  process  is  built  on  the  assumption  that  combmatcrially  explosive 
search  through  possible  explanations  should  be  avoided.  PAM  [Wilensky'S],  on  the  other  hand,  is 
an  explanation-based  understanding  system  that  searches  for  an  explanation  of  an  action  until  it 
exhausts  its  applicable  knowledge.  PAM  searches  for  an  explanation  of  a  character's  action  bv 
predicting  future  actions  the  character  might  take  and  seeing  if  any  of  them  satisfy  a  known  goal 
for  that  character.  In  terms  of  the  representation  presented  here,  when  PAM  processes  an  action,  it 
searches  through  possible  next  actions  that  have  preconditions  that  match  the  effects  of  the  previ¬ 
ous  action.  For  example,  consider  the  following  narrative,  which  is  normally  used  as  an  example  of 
how  PAM  processes  a  narrative. 

Willa  was  hungry.  She  picked  up  the  Michelin  guide  and  got  in  her  car. 

When  the  system  processes  the  action  that  Willa  has  picked  up  the  Michelin  guide,  it  considers  that 
the  next  action  she  might  perform  is  to  read  the  guide  since  possessing  a  book  is  a  precondition  for 
reading  it.  The  effect  of  reading  the  Michelin  guide  is  that  she  will  know  the  location  of  a  restau¬ 
rant.  Since  knowing  the  location  of  a  restaurant  is  a  precondition  for  going  to  a  restaurant,  it  con¬ 
siders  the  nex;  action  she  might  take  is  to  go  to  a  restaurant.  Finally,  since  being  at  a  restaurant  is 
a  precondition  for  eating  there,  it  considers  eating  at  the  restaurant  to  be  a  possible  next  action. 
This  action  is  realized  as  satisfying  her  hunger  and  therefore  the  chain  of  actions  just  discussed  is 
considered  to  be  the  explanation  for  her  actions.  If  at  some  time  the  system  cannot  propose  a  possi¬ 
ble  next  action  due  to  lack  of  knowledge  of  an  action  that  has  a  precondition  that  matches  the  pre¬ 
vious  action  s  effect,  then  it  backtracks  to  the  last  choice  and  picks  another  possible  action.  This 
search  is  continued  until  the  system  encounters  an  action  that  satisfies  a  known  goal  or  until  it  has 
exhausted  its  know  ledge  and  cannot  predict  any  more  possible  future  actions. 

However,  searching  through  a  space  of  possibie  future  actions  is  combinatorially  explosive. 
Consequently,  such  an  approach  is  intractable  if  a  system's  knowledge  of  actions  is  large,  which  it 
obviously  must  be  if  it  is  to  be  able  to  understand  a  wide  range  of  narratives.  Consider  what  a 
?AM-'uke  system  would  do  if  it  where  used  to  process  the  murder-for-inheritance  narrative.  L’pon 
determining  that  Agrippina  had  murdered  Claudius,  it  wouid  conduct  an  exhaustive  searcn  for  an 
explanation  of  this  action.  Such  a  system  would  spend  a  long  time  exploring  possible  courses  of 
action  that  Agrippina  could  take  now-  that  Claudius  is  dead.  She  could  bury  him.  sne  could  cre¬ 
mate  him.  she  could  donate  his  body  to  medicai  science,  etc..  A  svstem  with  a  large  knowledge  base 
::  actions  cc uld  search,  a  long  time  before  stumbling  upon  the  idea  of  her  inheriting  his  propertv 


■e.  mw.v; 
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Currently,  in  order  to  avoid  such  combinatonallv  explosive  search.  GENESIS  does  not  try  to 
predict  future  lor  past)  courses  of  action  in  this  manner.  If  an  action  does  not  suggest  a  known 
schema,  the  system  only  tries  to  connect  its  preconditions  to  the  effects  of  a  previous  action.  Of 
course,  suggesting  schemata  can  be  seen  as  predicting  possible  future  or  past  actions.  In  fact,  if  sug¬ 
gested  schemata  were  immediately  used  to  recursively  suggest  other  schemata,  the  system  would 
simply  be  conducting  combinatorialiy  explosive  search  through  a  different  space.5  However,  in 
GENESIS,  a  suggested  schema  is  not  used  to  suggest  other  schemata  until  it  has  accounted  for  addi¬ 
tional  actions  and  is  determined.  Therefore.  GENESIS’  conservative  approach  to  schema  selection 
greatly  limits  the  search  it  performs  during  this  process. 

Overall.  GENESIS  performs  relatively  little  search  during  understanding.  Its  approach  to 
understanding  is  for  the  most  part  non-predicti\e  [Segre87a].  or  "wait  and  see"  [Marcus74],  In 
fact.  GENESIS  is  a  little  too  conservative  with  regards  to  searching  for  an  explanation.  Performing 
search  for  missing  actions  is  clearly  necessary  for  understanding  narratives  such  as  the  PAM  exam¬ 
ple  shown  above.  But  searching  for  missing  actions  must  be  tightly  constrained  since  it  quickly 
becomes  intractable. 

Sir.ce  GENESIS  does  not  conduct  a  complete  search  for  an  explanation,  it  is  incapable  of 
"understanding"  narratives  that  have  missing  actions  and  do  not  suggest  known  schemata.  For 
example,  consider  the  processing  of  the  following  test  narrative  for  the  murder  example  before  the 
system  learns  a  murder-for-inheritance  schema. 

Input:  Martha  is  Gene's  mother  and  is  a  millionaire.  He  shot  her  and  she  died.  Gene  got 

SI  000000. 

Ready  for  questions: 

>  Summarize 

Gene  snot  Martha  and  killed  her. 

s?’.ar.-basec  understanding  as  performed  by  PA.M  can  be  modelled  as  schema  selection  if  each  action  nas  an  additional 
subaction  for  achieving  each  of  its  preconditions.  In  this  case,  if  an  action  A  meets  a  precondition  for  action  3.  then  A  must 
be  oe  a  subactior.  of  3  and  therefore  B  wouid  oe  suggested  as  a  possible  reason  for  performing  A.  If  suggested  schemata  re¬ 
cursively  suggest  other  scr.emata.  the  result  is  the  same  search  through  future  actions  that  PAM  performs.  A  scrtena  is 
selected  if  it  is  tne  final  action  in  a  chain  of  actions  that  begins  with  the  observed  action  and  ends  in  the  acnievement  o:  a 
thematic  goal.  The  formulation  of  morivcrion  analysis  presented  in  .CharnialcS5  (chapter  10);  car.  be  viewed  as  tailing  this 
approach. 


>  Why  did  Gene  shoot  Martha? 
So  Martha  would  die. 
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>  Why  did  Gene  kill  Martha’’ 
Cannot  find  sufficient  reason. 


i 


V. 


V. 


I 


I 

f 


>  Why  did  Martha  die7 

Because  Gene  shot  Martha  and  killed  her. 


>  How  did  Gene  get  the  money? 
Cannot  find  sufficient  cause. 


The  parsed  representation  of  this  narrative  is  shown  in  Figure  10.7.  The  crucial  difference  between  1 

this  story  and  the  Claudius  and  Agrippina  story  is  that  the  current  narrative  only  states  that  Gene 
has  the  money.  It  is  missing  the  action  declaring  that  he  inherited  it.  A  couple  of  additional 
actions  used  in  the  processing  of  this  narrative  are  Shoot  and  ShooiToK.il!  Ca  specialization  of 
Murder).  Definitions  for  these  actions  are  shown  in  Table  10.2  and  Table  10.3.  I 


When  processing  this  story,  the  input:  Shoot(Person2.  Personl.  7I\STR6)  suggests  the  schema: 
ShcctToKilKPerson2.  Personl.  ?I.\STR6).  The  subsequent  input  is  Die(Personl).  which  matches  a 


_ _  i 

!sa( Personl. Person).  Gender!  Personl  .female),  Xame( Personl  .Martha). 

Isa(Person2. Person).  Gender(Person2.male).  N'ame(Person2.Gene).  Isa(Monevl  .Money). 

Amount!  Money  1.  order-millions,  do  liar).  Mother!  Person  1  .Person2).  Possess!  Personl. Money  I ). 

Shoot! Person2. Personl  ,?I.\STR6).  Die! Person  I ).  Isa! Monev2, Money). 

Amount(Money2.lOOOOOO.dollar).  Possess! Person2,Monev2) 

Figure  10.7:  Parsed  Version  of  the  Murder  Test  Narrative 

- - -  1 

t 


Table  10.2:  Additional  Definitions  of  Action  Supports  and  Effects 


Action 

Constraints 

Preconditions 

Effects  • 

i  Shoot!?a.7v.’’g) 

Isa(?g.Gun) 

i  Possess! ?a.7g) 

Shot(?v)  | 

"’’a  shot  ?v  with  7e" 

Isa(?a. Person)  ! 

PointingAt(7a.?g.7v) 

Shoo  tTo  Kill!  °a.?v.?g) 

"’’a  shot  7v  and  killed  it" 

IsaC’g.Gun) 

i 

Possess!  ?a.?g) 

Shot!?v)  j 

Table  10.3:  Expansion  of  ShootToKill 


p 

Subact.ons  Internals 

Links 

V 

■i 

Shooti7a.'>v.',g)  ;  BadHealth(?v) 

;  DieCv)  j 

1  ! 

I 

(Precondition  (Subactior.  1)  (Precondition  1)) 
(Effect  (Subaction  l)  (Effect  1)) 

(Antecedent  (Internal  1)  (Effect  1)) 
(Precondition  (Subaction  2)  (Internal  1))) 

i 

Abstractions 

.v 

('.Effect  l)  Murder  (Internal  1)) 

('  Internal  1 )  Murder  (Internal  3)) 
Suoaction  1'  Murder  (Subaction 

1)))) 

((Effect  1)  Murder  (Internal  2)) 

((Subaction  2)  Murder  (Subaction  2)) 

subaction  in  the  suggested  ShootToKill  schema.  Since  both  of  the  subactions  of  the  suggested 
schema  have  been  explicitly  mentioned,  it  is  determined  and  added  to  the  causal  model.  The  facts 
that  'TXSTR6  is  a  gun  and  that  Gene  possesses  it  are  assumed  since  they  are  supports  of  the  deter¬ 
mined  schema.  The  final  input  stating  that  Gene  has  SI. 000, 000  does  not  match  part  of  a  suggested 
schema  and  is  simply  added  to  the  causal  model  as  a  disconnected  fact.  The  final  causal  model  con¬ 
structed  for  this  narrative  is  shown  in  Figure  10. S. 

The  causal  connections  added  by  the  ShootToKill  schema  allows  the  system  to  answer  a  cou¬ 
ple  of  questions  about  the  narrative:  however,  the  causal  model  is  not  sufficient  to  explain  why 
Gene  killed  Mary  or  how  he  got  the  money.  Without  a  schema  for  murder-for-inneritance.  the 
system  cannot  infer  the  missing  Inherit  action  since  it  does  not  conduct  a  potentially  expensive 
search  for  an  action  that  would  connect  Gene's  acquisition  to  the  rest  of  the  narrative. 


Motherl 


Paremi 


PositivelPTl 


j  GenesisAction2 


Possess3  Shoot  1 


Shot2 


ShootToKilll 


State3 

^State4_j=4i>  BadHealthl  — *■  Die3  v 
Lsa2  — *■  I  sag  \ 

- k**-  Murder2  /7 


Genderl  Namel  Ainountl  Possessl 

Gender2  Name2  Amount2  Possessd 


Deadl 

Dead(  Person  1 ) 

Martha  is  dead. 

Murder2 

Murder(  Person:. Person  1 ) 

Gene  murdered  Martha. 

ShootToKilll 

ShootToKilll  Person:, Person  1  .’’INSTR6 ) 

Gene  shot  Martha  and  killed  her. 

Isa  7 

Isal  TNSTR6,  Gun) 

The  gun  is  a  gun. 

?ossess3 

Possessl  Person2, 7INSTR6 ) 

Gene  has  the  gun. 

Snot2 

Shot!  Person  1) 

Martha  is  shot. 

Shoot  1 

Shoot(Person2,Personl,?INSTR6) 

Gene  shot  Martha  vith  the  gun. 

Isal 

Isal  Person  1  .Person ) 

Martha  is  a  person. 

Isa8 

IsalPerson2, Character) 

Gene  is  a  character. 

Isa2 

Isal  Person:  .Person ) 

Gene  is  a  person. 

Diei 

DielPersonl ) 

Martha  died. 

BadHeaithl 

3adHealth(Personl ) 

Martha  is  in  bad  health. 

Stated 

State(Personl) 

Martha  is  in  some  state. 

Statej 

State!  Personl) 

Martha  is  in  some  state. 

Genes:sAct:on2 

GenesisActionl  Person: ) 

Gene  did  something. 

PositivelPTl 

PositivelPTl  Person  1  .Person: ) 

Martha  has  a  positive  relationship  vc'ith  Gene 

Parent  1 

Parent!  Person  1  .Person: ) 

Martha  is  Gene's  parent. 

Motherl 

Motherl  Person  1  .Person: ) 

Martha  is  Gene's  mother. 

Isa3 

Isal  Money  1  .Money) 

Millions  of  dollars  is  money. 

IsalO 

Isal  Money  2, Money) 

The  S1000000  is  money. 

Genderl 

Gender*  Personl. female) 

Martha  is  female. 

Gender2 

Genderl  Person:. male) 

Gene  is  male. 

N’amel 

Name!  Person  1  .Martha ) 

Martha’s  name  is  Martha. 

N’ame2 

Namel  Person:. Gene) 

Gene's  name  is  Gene. 

Amount  1 

AmountlMonevl. order- millions, do  liar) 

Millions  of  dollars  is  order-millions  dollars 

of  money. 

Amount: 

Amount!  Money:,  1000000. dollar) 

The  S1000000  is  1000000  dollars  of  money. 

Possess! 

Possessl  Personl. Money  1 ) 

Martha  has  millions  of  dollars. 

Possessd 

Possessl  Person:. Money:) 

Gene  has  the  SI 000000. 

Figure  10.8:  Causal  Model  for  Murder  Test  Example  Before  Learning 


CHAPTER  11 


GENESIS:  SCHEMA  ACQUISITION 


If  the  understander  is  capable  cf  constructing  an  explanation  for  how  a  goal  was  achieved. 
GENESIS  may  be  able  to  use  EGGS  to  generalize  this  explanation  and  learn  a  new  schema  which  can 
aid  in  the  understanding  of  future  narratives.  First,  the  system  must  decide  whether  or  not  an 
explanation  is  worth  generalizing.  This  process  is  described  in  section  11.1.  Next,  a  special  pruning 
procedure  for  EGGS  eliminates  overly  specific  parts  of  the  explanation.  The  pruner  used  in 
GENESIS  is  described  in.  section  11.2.  Next,  the  EGGS  generalizer  is  used  to  generalize  the  resulting 
explanation.  In  addition  to  this  process,  a  couple  of  additional  constraints  must  be  added  to  insure 
that  all  actions  are  motivated.  This  process  is  discussed  in  section  11.3.  Finally,  a  special  packager 
for  EGGS  constructs  a  new  schema  from  the  generalized  explanation  and  indexes  the  schema  so  that 
its  subactions  suggest  it.  This  packager  is  an  enhancement  of  the  procedure  for  extracting 
partiailv-ordered  macro-operators  (see  section  6.2.2)  and  is  discussed  in  section  11.4.  In  addition,  a 
description  of  how  a  learned  schema  can  aid  the  understanding  of  future  narratives  is  presented  in 
section  11.5.  Throughout  the  chapter,  the  learning  of  the  murder-for-inheritance  schema  is  used  as 
an  example. 

11.1.  Deciding  UTien  to  Learn 

If  every  explanation  GENESIS  encountered  was  generalized  into  a  new  schema,  the  system 
would  scon  become  overloaded  with  rarely  used  schemata.  Most  actions  would  suggest  a  large 
number  of  schemata  and  selecting  among  these  would  require  an  excessive  amount  of  processing 
time.  This  problem  is  analogous  to  the  phenomenon  of  slowdown  in  problem  solving  systems  that 
learn  macro-rules  (see  section  7.2). 

One  approach  to  this  problem  is  to  generalize  all  explanations  and  later  eliminate  those  sche¬ 
mata  that  are  empirically  found  to  be  infrequently  used.  In  the  PEBLS  model  [SegreSTa].  the  con¬ 
ditions  used  to  determine  which  schemata  are  retained  are  called  the  retention  criteria.  An  example 
cf  this  approach  is  the  learning  of  S-Macros  in  MORRIS  [MintonS5],  which  was  discussed  in 
chapter  7.  Another  approach  is  to  selectively  learn  those  schemata  that  are  judged  likely  to  be  use¬ 
ful  in  latter  processing,  in  the  PEBLS  model,  the  conditions  used  to  determine  whether  or  not  an 
explanation  is  worth  generalizing  are  called  the  learning  criteria.  The  learning  of  T-Macros  in 
MORRIS,  also  mentioned  in  chapter  7.  is  an  example  of  this  approach.  Of  course,  these  two 
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approaches  to  determining  utility  are  not  necessarily  mutually  exclusive  and  MORRIS  and  PRO- 
DIG'S'  [MmtcnJwa.  MmionS”b]  employ  both  techniques. 

Currently.  GENESIS  only  has  criteria  for  selectively  learning  schemata.  Below  are  the  set  of 
criteria  that  GENESIS  uses  to  determine  whether  an  explanation  is  worth  generalizing 

(1)  It  should  be  an  explanation  of  how  a  thematic  goal  was  achieved. 

(2)  The  highest-level  explanation  for  the  goal  achievement  should  not  be  simply  an  instantiation 
of  a  known  schema. 

13)  Ail  actions  in  the  highest-level  explanation  should  rely  on  the  character  whose  thematic  goal 
was  achieved. 

The  first  criterion  is  crucial  for  insuring  that  the  acquired  schema  will  be  a  useful  one.  A 
method  for  achieving  a  state  that  satisfies  normal  human  wants  and  desires  is  likely  to  make  a 
schema  that  will  arise  again  and  again.  Recall  that  thematic  goals  were  defined  as  the  highest  level 
goals  that  motivate  a  character's  action.  If  an  action  achieves  a  thematic  goal  for  its  actor,  it 
requires  no  further  explanation.  Consequently,  an  explanation  is  considered  to  be  worth  generaliz¬ 
ing  into  a  new  schema  only  if  it  achieves  a  thematic  goal.  Since  the  understander  detects  the 
achievement  of  thematic  goals  and  passes  them  on  to  the  schema  learner  (see  section  10.2).  the 
schema  learner  itself  does  not  need  to  explicitly  check  this  criterion.  In  the  murder  example,  the 
expianation  sho'wn  in  Figure  10.6  satisfies  this  criterion  since  it  is  an  explanation  of  how  Agrippina 
achieved  her  thematic  goal  of  possessing  a  valuable  item,  namely.  Claudius’  estate. 

The  second  criterion  is  the  obvious  one  of  not  already  possessing  a  schema  for  the  combination 
of  actions  that  achieves  the  goal.  This  simply  involves  checking  the  highest-level  explanation  for 
the  goal  achievement  to  make  sure  it  contains  several  different  actions.  If  the  system  already  had  a 
schema  for  this  case,  it  w'ould  have  used  it  in  processing  the  narrative  and  the  goal  achievement 
would  be  explained  at  the  highest  level  by  an  instance  of  this  schema  instead  of  by  a  combination 
of  several  actions.  In  the  murder  example,  the  highest-level  explanation  satisfies  this  criterion  since 
it  is  composed  of  a  Poison  and  an  Inherit  action.  It  should  be  noted  that  actions  like  Murder,  which 
are  generalizations  of  other  actions  in  the  explanation,  do  not  count  as  separate  actions. 

The  third  and  final  criterion  insures  that  the  schema  that  is  learned  from  the  explanation  is  a 
volitional  action  that  a  character  can  use  to  achieve  his  own  thematic  goal.  Let  the  term  main  char¬ 
acter  refer  to  the  character  whose  thematic  goa!  was  achieved.  If  the  learned  schema  is  to  represent 
a  rian  that  can  be  executed  by  the  main  character  to  achieve  his  tnematic  goal,  actions  in  the 
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schema  that  are  no:  vclitionaliy  performed  by  :he  mam  character  should  a;  !eas:  be  moir. ated  by 
actions  that  he  performs.  In  order  to  insure  this,  ncn-voiiuor.al  actions  m  the  explanation  must 
have  a  precondition  supported  by  a  volitional  action  performed  by  the  main  character.  For  exam¬ 
ple.  the  explanation  for  the  murder  story  contains  the  non-volitional  action  Inherit,  hove  ever,  the 
precondition  of  this  action  that  states  that  the  benefactor  must  be  dead  is  achieved  by  Agrippina's 
Murder  action  and  Agrippina  is  the  main  character  since  :t  is  her  thematic  goal  that  is  achieved. 
Also,  volitional  actions  in  the  highest-level  explanation  performed  by  actors  other  than  the  main 
character  must  have  a  motivation  supported  by  an  action  performed  by  the  main  character.  For 
example,  in  the  learning  of  an  arson-for-insurance  schema  (an  example  presented  in  appendix  B), 
someone  burns  his  own  warehouse,  which  causes  his  insurance  company  to  reimburse  him  for  the 
loss.  The  explanation  for  this  example  can  be  generalized  into  a  volitional  schema  Even  though  the 
volitional  Indemnify  action  in  the  explanation  is  performed  by'  another  actor  (i.e.  Prudential),  this 
action  is  motivated  by  the  fact  that  they  believe  his  barn  is  burnt,  which  in  turn  is  supported  by 
the  Burn  action  performed  by  the  main  character  (see  section  *B.2.3  for  details). 

If  the  third  criterion  were  not  used,  the  system  could  learn  schemata  that  contain  serendipi¬ 
tous  actions  over  which  the  main  character  has  no  control.  For  example,  assume  GEXES1S  had  the 
knowledge  to  explain  the  following  narrative  but  did  not  have  the  third  learning  criterion. 

John's  rich  uncle  was  killed  in  an  earthquake.  John  inherited  a  million  dollars. 

Such  a  system  would  acquire  an  Earthquakelnherit  schema  (or  possibly  a  NaturalDisasterlnhent 
schema)  from  us  explanation  of  this  narrative.  Such  a  non-volitional  schema  would  prebabiy  not 
be  that  useful  for  understanding  later  narratives. 


11.2.  Pruning  the  Explanation 

w. 

If  an  explanation  meets  all  three  learning  criteria.  GENESIS  proceeds  to  generalize  it  into  a 
new  schema.  In  order  to  increase  to  the  generality  and  applicability  of  the  resulting  schema. 
3-)  GENESIS  has  a  pruning  procedure  that  removes  unnecessarily  specific  branches  from  the  explana¬ 

tion  prior  to  generalization.  If  such  pruning  were  not  performed,  the  system  would  frequently 
•V  learn  schemata  that  are  too  restrictive  to  be  useful.  For  example,  if  the  explanation  for  the  Clau¬ 

dius  and  Agripp.na  narrative  were  not  pruned,  the  resulting  schema  would  only  cover  cases  in 
V  which  a  wife  poisoned  her  husband  with  a  mushroom  in  order  to  inherit  an  estate. 

A  description  of  the  GENESIS  pruning  algorithm  is  given  in  Figure  11.1.  The  fact  that  only 
the  highest-ievei  explanation  is  considered  during  generalization  is  in  itself  a  type  of  pruning  This 
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for  each  inference  rule  ar..a„  —  c  m  the  highest-level  explanation  structure  ac 


if  PrunabieTvue(a,...a  — c) 

*  *  1  n 


then 

let  q  =  true 

for  each  fact  f  which  eventually  supports  c  do 
for  each  path  p  from  f  to  the  goal  do 
let  r  =  true 

for  each  inference  rule  b,...b  —  c  in  p  do 

1  rr.  r 

if  PrunableType(br..bm—c)  then  let  let  r  =  false 
if  r  then  let  q  =  false 

if  a  then  remove  a,...a  — c  from  explanation 


function  PrunableTvpe(a....a„-*c) 

*  r  l  n 


if  a,. ..a  —c  is  an  abstraction  inference  then  return  true 

1  n 


if  there  is  more  than  one  inference  rule  in  the  library  whose  conclusion  unifies  with  c 

then  return  true 
else  return  false 


Figure  11.1:  GENESIS  Pruning  Procedure 


is  because  the  highest-level  explanation  does  not  include  expansions  of  schemata  that  oniy  support 
the  goal  through  the  overall  effects  of  the  schema.  For  example,  in  the  murder  example,  the 
highest-level  explanation  (see  Figure  10.6)  does  not  include  the  Atrans.  Ingest,  or  Die  actions  in  the 


expansion  of  the  Poison  schema. 


In  addition,  the  procedure  in  Figure  11.1  attempts  to  prune  certain  types  of  inferences  from 
the  highest-level  explanation  structure.  One  type  of  prunable  inference  includes  abstraction  infer¬ 
ences  based  on  GENESIS'  hierarchies  of  objects,  states,  or  actions.  For  example,  below  is  a  list  of 
the  abstraction  inferences  in  the  explanation  of  the  Claudius  and  Agrippina  example: 


Isa( Mushroom  1.  Mushroom)  —  IsafMushroom.  Food) 


Isa(Estatel.  Estate)  —  Isa(Estatel.  Valuable) 


IsafEstatel.  Valuable)  —  Isa( Estate  1 .  Inanimate) 


Isa(Person2.  Person)  —  lsa(Person2.  Character) 


Husband! Person  1.  Person2)  —  Spouse(Personl.  Person2) 


Poison(Person2,  Personl.  Mushrooml)  —  Murder('Person2.  Personl.’ 


Another  t'.  re  of  prunable  inference  includes  inferences  that  conclude  facts  that  could  be  deduced  in 
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a  variety  of  other  ways  in  general.  For  example,  below  is  a  list  of  such  inferences  in  the  murder 
example 

Spouse1  Perscnl.  Person!)  —  Spousei Person2.  Personl) 

Spouse! Person!.  Personl)  —  HeirfPersonl .  Person!) 

The  first  rule  is  in  this  class  because  the  system  has  rules  for  deducing  that  someone  is  somebody's 
spouse  other  than  by  commutativity  (e.g.  from  Husband.  Wife,  or  Married).  The  second  rule  is  in 
this  class  because  the  system  has  rules  for  deducing  that  someone  is  somebody's  heir  other  than 
from  an  instance  of  Spouse  (e.g.  from  an  instance  of  Parent).  In  order  to  prevent  the  system  from 
generating  a  schema  that  has  a  support  that  is  logically  entailed  by  another  support,  not  all  prun- 
able  inferences  are  immediately  removed  from  the  explanation.  An  inference  rule  that  is  classified 
as  prunable  is  not  pruned  if  it  is  supported  by  a  fact  that  cannot  itself  be  pruned  because  it  sup¬ 
ports  the  goai  directly  (i.e.  not  through  a  prunable  inference).  In  the  murder  example,  the  rule: 

Isa(Estatel.  Valuable)  —  Isa(Estatel.  Inanimate) 

is  not  actually  pruned  because  the  antecedent:  Isa(Estatel.  Valuable)  directly  supports  the  fact  that 
a  thematic  goal  has  been  achieved.  If  this  rule  were  pruned,  the  resulting  schema  would  have  two 
constraints  on  the  inherited  item,  one  stating  that  it  is  a  Valuable  and  one  stating  that  it  is  Inani¬ 
mate  since  both  facts  would  become  leaves  of  the  final  explanation.  However,  the  latter  fact  can  be 
easily  deduced  from  the  former,  as  it  was  in  the  original  explanation.  Retaining  prunable  rules  that 
are  supported  by  facts  that  directly  support  the  goal  prevents  such  redundant  constraints  from 
appearing  in  the  final  schema.  All  of  the  other  prunable  rules  listed  above  are  removed  since  they 
are  not  supported  by  facts  that  support  the  goal  through  a  path  that  does  not  contain  a  pruna'oie 
inference. 

Not  pruning  rules  with  antecedents  that  directly  support  the  goal  also  prevents  the  removal 
of  specialized  actions  that  are  crucial  to  the  goal  achievement.  For  example,  if  the  Poison  action 
had  an  eifect  that  was  not  inherited  from  Murder  and  that  supported  the  goal  achievement  directly, 
then  it  would  not  have  been  pruned.  Howes er.  since  all  of  its  effects  that  eventually  support  the 
goal  are  inherited  from  Murder,  the  system  does  not  consider  it  to  be  crucial  to  the  overall  plan  and 
removes  it. 

Due  to  the  representation  of  actions  and  their  abstractions,  the  pruning  algorithm  shown  in 
Figure  11.1  is  capable  of  performing  the  '.ask  of  generalizing  predicates  discussed  in  [DeJongS6b]. 
Generalizing  predicates  refers  to  the  process  of  altering  the  descriptions  of  actions  bv  changing 
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relatively  specific  pred. cates  such  as  Poison  to  more  general  predicates  such  as  Murder.  Unlike  the 
current  version,  tne  original  GENESIS  system  [MooneyS5a.  MooneySSb],  required  a  special  pro¬ 
cedure  for  this  task 

11.3.  Generalizing  the  Explanation 

After  the  initial  explanation  is  pruned,  it  is  generalized  using  the  EGGS  generalizes  The  gen¬ 
eralized  explanation  for  the  murder  example  is  shown  in  Figure  11.2.  Unfortunately,  this  generali¬ 
zation  coes  not  correspond  tc  the  intuitive  generalization  of  the  Claudius  and  Agrippina  narrative 
since  the  explanation  structure  itself  does  not  impose  the  constraint  that  the  murderer  (7x26)  and 
the  heir  (7a5)  be  the  same.  This  is  because  the  schema  still  "works-'  if  they  are  different:  however, 
the  Murder  is  left  unmotivated  in  this  case.  Since  the  resulting  plan  contains  an  unmotivated 
action  by  another  agent,  it  does  not  represent  a  volitional  action  and  is  probably  not  worth  learn¬ 
ing.  However,  the  plan  can  be  made  volitional  by  requiring  the  actor  of  everv  unmotivated 


Possess8  n  \ 
Murder  1  — >•  Deadl 


yf  -N0l° 

Inherit  1  Possess9 


-*■  ThemeGcalMetl 


ThemeGoalMe:  1 

ThemeGoaIMet(7a5.Possess(?a5.7o4)) 

7a 5  is  happy  that  7a5  has  7o4 

Isa  21 

Isa(7o4. Valuable) 

7o4  is  a  valuable. 

Possess9 

Possessv7a5.7o4) 

7a5  has  7o4. 

Inherit  1 

Inherit(7a5.?o4.?dl ) 

7a5  inherited  ?dl"s  7o4. 

Isa  15 

Isa(?o4.  Inanimate) 

7o4  is  an  inanimate  object. 

Isa20 

lsa(7dl  .Person) 

7dl  is  a  person. 

Isa  12 

Isa(  7a5  .Character ) 

7a5  is  a  character. 

Deadl 

DeadC^d 1) 

7dl  is  dead. 

Murder  1 

Murder(?x26.?dl ) 

7x26  murdered  ?dl. 

IsaS 

lsa(7x26. Character) 

7x26  is  a  character. 

PcssessS 

Possess(7dl  .7o4) 

7dl  had  ?o4. 

Heir  1 

Heir(7a5.7dl) 

7a 5  is  7dl"s  heir. 

Not6 

’Possess(7dl.7o4) 

7dl  does  not  have  7o4. 

Figure  11.2:  Murder  Example  —  Initial  Generalized  Explanation 


oliiional  action  to  be  the  same  as  the  character  whose  thematic  goal  is  achieved.  A  more  formal 
description  of  this  process  is  giver,  m  Figure  11  3.  This  procedure  insures  that  actions  that  are  cih- 
erw  ise  unmotivated  are  motivated  by  the  eventual  thematic  goal  achievement.  Any  volitional 
action  m  the  explanation  that  was  performed  by  another  actor  in  the  original  instance  is  guaranteed 
to  have  motivation  supports.  This  is  because  any  unmotivated  action  by  another  actor  would  have 
violated  the  third  learning  criterion  and  prevented  generalization.  In  the  murder  example,  the  pro¬ 
cedure  in  Figure  11.3  substitutes  °a5  for  ’’xdb  and  produces  the  final  generalization  shown  in  Fig¬ 
ure  11  4. 

It  might  be  argued  that  requiring  an  additional  step  to  fix  such  a  problem  is  somewhat  ad  hoc 
and  that  all  constraints  should  be  represented  in  the  explanation  structure  and  enforced  by  the  nor¬ 
mal  generalization  process.  A  better  solution  to  this  problem  might  be  to  explicitly  add  motiva¬ 
tions  to  actions  in  the  explanation  structure.  That  is.  if  the  ultimate  goal  is:  ThemeGoalMet(x.y) 
the  motivation:  GoaKa^v)  would  have  to  be  added  to  each  unmotivated  action,  where  a_  is  the  actor 
of  that  action  in  the  explanation  structure.  Ail  of  these  motivations  would  then  have  to  be  equated 
to  the  form:  Goal(x.v)  in  order  to  insure  that  all  of  these  actions  were  motivated  by  the  final  goal 
achievement.  The  normal  EGGS  process  would  then  enforce  the  unifications  now  enforced  by  the 
procedure  in  Figure  11.3.  Although  this  approach  is  in  some  sense  "cleaner."  it  is  functionally 
equivalent  to  the  procedure  in  Figure  11.3.  and  the  added  motivations  would  have  complicated  the 
question-answering  process. 

At  this  point,  a  brief  digression  is  warranted  regarding  the  third  learning  criterion  since  it 
interacts  in  an  interesting  wav  with  the  process  of  equating  actors  to  the  mam  character.  For 

let  m  be  the  variable  for  the  main  character  in  the  generalized  explanation 
for  each  volitional  action  a  in  the  generalized  explanation  do 
if  a  does  not  have  any  motivation  supports 

then 

let  b  be  the  actor  of  a 
if  b^m  then 
let  6  =  ! m/bi 

for  each  wff  f  in  the  generalized  explanation  do 
replace  f  with 

Figure  11.3:  Procedure  for  Equating  Actors  to  the  Main  Character 
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PossessS  \  \ 
Murderl  — >■  Deadl 


Inherit! 


Pcssess9 


ThemeGoalMetl 


TnemeGoaiMetl 

Isa2l 

Possess9 

Inherit  1 

Isa  15 

Isa20 

Isa22 

Deadl 

Murderl 

PossessS 

Heirl 

Not6 


ThemeGoalMet!  "a5  .Possess!  "a5  "o4 ) ) 
Isa!  "04.  Valuable) 

Possess! '’a5."o4) 

Inher:t("a5.'’o4.?dI ) 

Isa(°o4.  Inanimate) 
lsa("d  1. Person) 

Isa("a5  .Character) 

Dead(?dl) 

Murder(?a5."dl) 

Possessed  1  ,"o4) 

Heir(?a5.?dl) 

-Possessed  l."o4) 


'’ad  is  happy  that  °a5  nas  "o4. 
"o4  is  a  valuable. 

"a5  has  7o4. 

"a5  inherited  7dls  ?o4. 

?o4  is  an  inanimate  object. 

?dl  is  a  person. 

°ad  is  a  character. 

"dl  is  dead. 

"a5  murdered  ?dl. 

7dl  had  "o4 
"ad  is  ?dl’s  heir. 

"d  1  does  not  have  7o4. 


Figure  11.4:  Murder  Example  —  Generalized  Explanation 


example,  the  third  learning  criterion  prevents  GdaESIS  from  generalizing  the  explanation  con¬ 
structed  for  the  following  narrative. 


Bob  is  Jane's  husband  and  is  a  millionaire.  Stan  murdered  Bob  and  Jane  inherited  Si. 000.000. 


This  is  because  Jane's  inheritance  is  not  supported  by  her  own  volitional  action.  However,  the  gen¬ 
eralization  EGGS  produces  for  the  explanation  of  this  example  is  exactly  the  same  as  the  one  it  pro¬ 
duces  for  the  Claudius  and  Agrippina  narrative  (see  Figure  11.2).  If  the  process  of  equating  the 


actors  of  otherwise  unmotivated  actions  to  the  main  character  is  also  applied  to  the  generalization 


of  this  narrative,  the  murderer  is  reauired  to  be  the  same  as  the  heir  and  the  result  is  the  same  as 


the  final  generalized  explanation  for  the  Claudius  and  Agrippina  story  (see  Figure  1 1.4).  GENESIS 
could  be  altered  to  apply  the  third  learning  criterion  only  after  the  explanation  has  been  general¬ 


ized.  in  which  case  it  could  iearn  the  same  schema  from  this  storv  as  it  learns  from  the  Claudius 


and  Agrippina  one.  However,  since  in  the  general  case  this  process  would  be  equating  actors  that 
were  not  necessarily  the  same  in  the  specific  instance,  it  might  be  equating  variables  that  result  in  a 
protection  violation  '’see  section  6.2.2 A).  Consequently,  judging  the  third  learning  criterion  would 
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have  lo  be  delayed  ever,  furtner  unii!  me  packaging  process  determined  the  necessary  constraints  to 
pre\ ent  protection  violations.  If  equating  all  the  actors  of  unmotivated  actions  to  the  main  charac¬ 
ter  did  not  violate  an>  of  these  constraints,  then  it  could  be  carried  out  and  a  volitional  schema 
ceuid  be  learned. 

This  approach  could  result  in  learning  a  schema  that  is  not  a  generalization  of  the  narrative 
that  gave  rise  to  it.  For  example,  the  schema  that  would  be  acquired  from  the  narrative  about  Bob 
ar.d  Jane  would  require  that  the  murderer  and  the  heir  be  the  same  although  they  were  different  in 
the  original  story.  Since  the  goal  of  EBL  is  to  maintain  the  structure  of  the  explanation  for  the  ori¬ 
ginal  example,  this  approach  is  not  taken  and  the  third  learning  criteria  is  applied  prior  to  generali¬ 
zation.  Also,  waiting  to  apply  the  third  criterion  could  result  in  casting  work  on  pruning,  general¬ 
izing.  and  packaging  an  explanation  that  does  not  result  in  acquiring  a  schema  since  it  fails  to  meet 
the  learning  criteria. 

1 1.4.  Packaging  the  Explanation 

The  final  step  in  learning  a  schema  is  packaging  the  generalized  explanation  into  a  form  suit¬ 
able  for  the  schema  library.  The  packaging  process  used  in  GENESIS  is  a  slight  enhancement  of  the 
procedure  for  learning  partially-ordered  macro-operators  (see  section  6.2.2).  First  the  leaves  of  the 
generalized  explanation  are  divided  into  constraints,  preconditions,  and  motivations  based  on  the 
manner  in  which  they  support  actions  in  the  generalized  explanation.  For  example,  since  the  leaf: 
Heir(°a5.  ’’dl)  is  a  precondition  of  the  Inherit  action  ir.  the  generaiized  explanation  for  the  murder 
example,  it  is  mace  a  precondition  of  the  overall  schema.  Protection  violations  are  prevented  and 
the  effects  of  the  schema  are  determined  using  the  procedure  presented  in  section  6.2.2.  The  sup¬ 
ports  and  effects  for  the  schema  constructed  from  the  murder  exampie  are  shown  in  Table  11.1.  A 
name  for  the  new  schema,  such  as  Murderlnherit.  is  constructed  by  concatenating  the  names  of  the 
actions  in  the  generaiized  explanation. 


Table  11.1:  Supports  and  Effects  of  the  Murderlnherit  Schema 

.Murder  Inherits  ?a5  ,?o4  ,?d  1 ) 

Constraints 

i  Preconditions 

Effects 

IsalVif  .Character) 
Isafi’d  1  .Person) 
Isaf',o4. Valuable) 

j  Heir(?a5.',dl ) 

!  Possess(?dl  .?o4) 

j  Dead(7dlj 
j  -Possess! ?dl.?o4) 
j  Possess(',a5.?o4) 

j.v  j.’Vj.Vj.'V j,".  s  .s 


.-N-N 


"-V-j.'* 


in  d c ci i 


ne  r.o  A 


lion.  :ne  in;err.al  siruciure  oi  the  explanation  i<  stored  as  the  exp*, 
scheme:  The  actions  :n  me  explanation  form  the  subactions  of  the  neve  schema 
the  explanation  that  are  neither  supports  nor  effects  mace  up  the  internals, 
between  all  o:  the  facts  in  the  explanation  make  up  the  links  of  the  new  senem 
that  GENESIS  constructs  for  the  Murderlnherit  schema  is  shown  in  Table  11.2. 
nary  oi  the  resulting  schema  is  given  in  the  trace  for  this  example  presented  in  se 


me  other  tacts  n 
The  connections 
a.  The  expansion 
An  English  sun¬ 
ken  lo.: 


The  f.r.ai  step  in  the  packaging  of  a  new  schema  is  to  index  the  schema  so  that  the  svstem  is 
reminded  or  it  whenever  it  might  be  helpful  in  processing  a  future  narrative  Indexing  is  per¬ 
formed  by  having  the  schema  s  volitional  subactions  suggest  it.  This  is  appropriate  because  the  new 
schema  may  now  provide  an  explanation  for  why  someone  is  executing  the  subacticn  In  tne 
murder  example,  the  following  suggestion  is  added  to  the  Murder  action: 

Murder(?a5.  ?dl)  Murderlnher:t(°a5.  °o4.  "dl ) 

A  suggestion  is  not  added  to  Inherit  because  it  is  a  ncn-vclitional  action. 


11.5.  Using  the  Learned  Schema 

Once  a  schema  is  added  to  the  schema  library  and  suggestions  are  added  to  index  it.  instances 
cf  its  subactions  in  future  narratives  will  suggest  it.  If  an  instance  of  the  new  schema  is  subse¬ 
quently  determined,  it  can  result  in  GEN’ESIS  inferring  missing  actions  that  it  was  previously  inca¬ 
pable  oi  inferring.  Fcr  example,  consider  the  processing  of  the  murder  test  narrative  after  the  sys¬ 
tem  has  acquired  Murderlnherit. 


input:  Martha  is  Gene’s  mother  and  is  a  millionaire.  He  shot  her  ar.c  she  died.  Gene  got 
SI 000000. 


_ Table  11.2:  Expansion  of  Murderlnherit _ 

_ Subactions _ : _ Internals 

Murcer(?a5,°dlJ  ,  Isai?o4. Inanimate) 

Inherit('1a5.',o4.?d! ) 


(Antecedent  (Expansion  1)  (Constraint  3)) 
(Constraint  (Determine  1;  (Constraint  l)) 

'  Precondition  ( Determine  2)  ( Effect  1 )) 
Precondition  (Determine  2)  (Precondition  l)J 
'Constraint  (Determine  2)  (Constraint  2)) 
(Effect  (Determine  2';  (Effect  2),) 


Links 

(Constraint  (Determine  1)  (.Constraint  2)j 
(Effect  (Determine  1)  (Effect  1)) 

(Precondition  (Determine  2)  Precondition  2(0 
(Constraint  (Determine  2)  ( Expansion  1 ) 
(Constraint  (Determine  2)  ;  Constraint  l)' 

'  Effect  f  Determine  2)  1  Effect  3  ( i 
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Thematic  goal  achieved:  Gene  is  happy  that  Gene  has  tne  SI 000000 
Ready  for  questions: 

>  Summarize 

Gene  murdered  Martha.  Martha  had  S1000000.  Gene  was  Martha's  heir.  Gene  inherited 
Martha's  S1000000. 

>  Why  did  Gene  kill  Martha? 

So  Gene  could  inherit  Martha's  S1000000. 


>  How  did  Martha  die7 

Gene  shot  Martha  and  killed  her. 


■ 

I 

V, 


>  How  did  Gene  get  the  money? 

Gene  inherited  Martha's  SlOOOOOO. 

>  Why  did  Gene  inherit  the  money7 

Because  Gene  shot  Martha  and  killed  her.  because  Martha  had  millions  of  dollars  and  because 
Martha  was  Gene's  mother. 

>  Why  was  Gene  Martha's  heir7 
Because  Martha  was  Gene's  mother. 


-w‘ 


h 

V. 


>  Why  did  Martha  have  SlOOOOOO7 
Because  Martha  had  millions  of  dollars. 

This  time,  when  a  Murder  action  is  determined  from  the  actions  in  the  second  sentence  (see  section 
10.3).  it  suggests  the  following  instance  of  the  new  schema: 

MurderInherit(Person2.  ?o4-2.  Personl) 

in  addition,  the  Murder  action  in  the  suggested  schema  is  marked  "found"  as  are  its  following  sup- 
rorts  ar.d  effects: 


•■4 


Isav  Person!.  Character) 
Isa< Person  1.  Person) 
Dead!  Person  1) 


The  final  input:  Possess(Person2.  Money2)  matches  an  effect  of  the  Inherit  action  in  the  suggested 
Murderlnherit  schema  and  binds  °o4-2  to  Money2  since  Money 2  satisfies  the  constraint  of  being  a 
Valuable.  Since  both  of  the  positive  effects  of  the  Inherit  action  are  now  marked  as  found  (i.e. 
Deadt Person l)  and  Possess(Person2.  Mor.ey2)).  the  Murderlnherit  schema  is  determined  and  the 
missing  Inherit  action  is  added  to  the  causal  model.  The  precondition  stating  that  Martha  must 
have  possessed  SI. 000 .000  is  inferred  from  the  fact  that  she  had  millions  of  dollars  and  the  precon¬ 
dition  stating  that  Gene  is  Martha's  heir  is  inferred  from  the  fact  that  she  is  his  mother.  The  final 
explanation  that  GENESIS  constructs  for  this  example  is  shown  in  Figure  11.5.  This  explanation 
allows  the  system  to  answer  questions  that  it  could  not  answer  based  on  the  representation  it  built 
without  the  Murderlnherit  schema  (see  Figure  10.5).  The  Murderlnherit  schema  also  provides  a 
structure  for  paraphrasing  the  narrative. 

The  Murderlnherit  schema  can  also  be  used  tc  understand  narratives  missing  even  more  infor¬ 
mation.  For  example,  consider  the  following  GENESIS’  input,  output  trace: 

Input:  Mary  is  the  Bob's  mother  and  is  a  millionaire.  Someone  strangled  Mary.  Bob  got 

SI 000000. 

Thematic  goal  achieved:  Bob  :s  happy  that  Bob  has  the  S1000000. 

Ready  for  questions: 

>  Summarize 

Bob  murdered  Mary.  Mary  had  Slr'000<Xi.  3ob  was  Mary’s  heir.  Bob  inherited  Mary's 

S 1 000000 


>  He  *  did  Bob  get  the  money0 
3ob  inherited  Mary's  S1000000. 

>  'V  lie  strangled  Mary0 
Bob  strangled  Mary 


139 


Therr.eGoal.Met2 

ThemeGoalMet!  Person2. 

Possess!  Perscn2,Monev2)) 

Gene  is  happy  that  Gene  has  the  $1000000. 

IsalS 

Isal  Money  2,  Valuable) 

The  S1000000  is  a  valuable. 

Isa  14 

Isa!  Money  2. Money ) 

The  SIOOOOOO  is  money. 

Pcssesso 

Possess!  Person2  .Money  2 ) 

Gene  has  the  SlOoOOOO. 

Inheritl 

Inherit!  Person2.. Money  2, Personl) 

Gene  inherited  Martha's  S 1000000. 

IsaS 

Isa!Person2,  Character) 

Gene  is  a  character. 

Isa  2 

Isa!  Person2, Person) 

Gene  is  a  person. 

Isal 

Isa!  Person  1  .Person) 

Martha  is  a  person. 

Isa  13 

Isa(.Money2. Inanimate) 

The  SIOOOOOO  is  an  inanimate  object. 

He-.:  2 

Heir!  Person2, Person  1 ) 

Gene  is  Martha's  heir. 

Parent  1 

Paren  t!  Person  1  ,Person2 ) 

Martha  is  Gene's  parent. 

Mot  her  1 

Mother!  Person  l.Person2) 

Martha  is  Gene's  mother. 

PossessS 

Possess!  Person  1  .Money  2 ) 

Martha  had  the  SIOOOOOO. 

LessThan  1 

LessThan!  1 000000, order-millions) 

1000000  is  less  than  order-miilions. 

Isa3 

Isa!  Money  1  .Money ) 

Millions  of  dollars  is  money. 

Amount2 

A  mount!  Money 2. 1000000. dollar) 

The  SIOOOOOO  is  1000000  dollars  of  money. 

Possess  1 

Possess!  Personl.  Moneyl ) 

Martha  has  millions  of  dollars. 

Amount! 

A  mount!  .Money  1  .order-millions.doilar) 

Millions  of  dollars  is  order-millions  dollars 

of  money 

Deadl 

Dead! Personl ) 

Martha  is  dead. 

Murder2 

Murder! Person2. Personl ) 

Gene  murdered  Martha. 

ShootToKilli 

ShootToK.il  1!  Person2.  Personl  TINSTR6) 

Gene  shot  Martha  and  killed  her. 

Isa" 

IsariMSTR6.Gun) 

The  gun  is  a  gun. 

Possessj 

Possess!  Person2.?INSTR6 ) 

Gene  has  the  gun. 

Shot2 

Shot! Personl ) 

Martha  is  shot. 

N'otl 

-Possess! Personl  ..Money 2) 

Martha  does  not  have  the  SIOOOOOO. 

Figure  1 1_5:  Murder  Test  Example  After  Learning  —  Specific  Explanation 


>  Why  did  Bob  strangle  Mary? 

So  Bob  could  inherit  Mary's  $1000000. 

Since  Strangle  is  a  specialization  of  Murder,  the  input:  Strangle! '’ACTOR  1 .  Person l)  suggests  the 
scnema:  MurderIr.her;t(?ACTORl.  ,,o4-2.  Personl).  The  input  stating  that  Bob  got  a  million  dollars 


7\ 


"vm  i.' 
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matches  the  effect:  Possess'TACTORl.  °o4-2)  of  the  suggested  schema  since  Bob  is  a  character  and  a 
million  dollars  is  a  valuable  item.  This  match  leads  to  the  determination  of  MurderTnherit  and 
identifies  Bob  as  the  actor. 

However,  it  is  aiso  important  to  note  that  not  every  narrative  that  might  possibiy  fit  the  new 
schema  actually  suggests  it.  For  example,  consider  the  following  GENESIS'  input/output  trace: 

Input:  Martha  is  Gene’s  mother  and  is  a  millionaire  She  died.  Gene  inherited  S1000000. 

Thematic  goal  achieved:  Gene  is  happy  that  Gene  has  the  S1000000. 

Ready  for  questions: 

>  Summarize 

Martha  died.  Gene  inherited  Martha's  $1000000. 


n> 


v; 


>\Vho  murdered  Martha0 

Question  refers  to  unknown  event,  person,  place,  or  thing. 

>  Why  did  Martha  die0 
Cannot  find  sufficient  reason. 


>  Whv  did  Gene  inherit  the  monev°  „•} 

Because  Martha  was  Gene's  mother,  because  Martha  had  millions  of  dollars  and  because  Martha  died. 


The  system  identifies  Martha  as  the  benefactor  of  the  inheritance  since  she  is  the  one  that  is  dead 
and  has  the  money:  however.  Murderlnherit  is  never  suggested  because  the  Inherit  action  is  non- 
volitionai  and  does  not  suggest  the  new  schema. 
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CHAPTER  12 

GENESIS:  LEARNING  WORD  MEANINGS 


As  revealed  in  the  I/O  trace  presented  in  chapter  2.  GENESIS  also  has  the  ability  to  learn  pro¬ 
visional  meanings  for  words  from  one  example  of  their  use.  While  processing  the  learning  narra¬ 
tive  for  the  kidnapping  example,  the  system  encounters  the  following  unknown  words:  "kid¬ 
napper.”  "ransom."  and  "kidnap."  Based  on  the  context  in  which  these  words  appear  and  their 
relation  to  the  overall  schema  that  is  learned,  the  system  acquires  preliminary  meanings  for  each  of 
these  unknown  words.  This  chapter  describes  the  procedure  GENESIS  uses  to  learn  related  word 
meanings  when  it  is  acquiring  a  new  schema.1  Section  12.1  attempts  to  motivate  the  approach  taken 
in  GENESIS  by  revealing  a  problem  with  previous  approaches.  Next,  since  the  procedure  for  learn¬ 
ing  role  labels  such  as  "kidnapper"  and  "ransom"  is  significantly  different  from  that  used  to  learn 
scheme  labels  such  as  "kidnap."  a  separate  section  is  dedicated  to  each  of  these  processes. 

12.1.  A  Problem  with  Previous  Models  of  Learning  Word  Meanings 

Previous  computational  models  of  the  acquisition  of  word  meaning  [BerwickSS.  Granger77. 
Self ridg«S2]  have  assumed  existing  knowledge  of  the  concept-underlving  the  word  to  be  learned.  In 
these  models,  word  learning  is  a  process  of  using  surrounding  context  to  establish  an  identification 
between  a  new  lexical  item  and  a  known  concept.  However,  new  words  are  not  always  encoun¬ 
tered  as  labels  for  known  concepts.  When  encountering  a  new  concept  in  natural  language  text  or 
discourse,  it  is  quite  likely  that  one  will  also  come  across  unknown  words  that  refer  to  various 
aspects  of  the  new  concept.  A  word  learning  model  that  requires  prior  knowledge  of  the  underly¬ 
ing  concept  will  be  unable  to  acquire  even  provisional  meanings  for  such  words.  For  example,  such 
an  approach  could  not  iearn  meanings  for  the  unknown  words  in  the  learning  narrative  for  the  kid¬ 
nap  example  since  when  the  system  first  encounters  these  words,  it  does  not  have  any  knowledge  of 
the  concepts  underlying  them. 

Ir.  addition,  developmental  studies  in  psychology  suggest  that  the  learning  of  words  and  their 
underlying  concepts  frequently  occurs  concurrently  in  children.  Experiments  by  Gopnik  and 
Meitrcff  revealed  that  children's  acquisition  of  "disappearance"  words  occurred  at  about  the  same 
time  they  learned  to  soive  object-permanence  tasks  involving  invisible  displacements  [Gopmkf>6j. 


1  a  neseripticr.  s:  GENESIS'  verd  lea 


:r.g  abilities  .5  also  presented  ir.  '  Micr.eys'a  ,. 
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From  this  data,  they  conduced  that  learning  may  often  involve  "concurrent  cognitive  and  semantic 
developments,  rather  than  involving  cognitive  prerequisites  for  semantic  developments"  fp.  1051). 
They  also  state.  "Tins  raises  the  interesting  possibility  that  conceptual  and  semantic  development 
may  occur  concurrently,  with  each  area  of  development  influencing  and  facilitating  the  other. (p. 
1051)"  Bowerman  [BowermanSO]  and  Kuczaj  [KuczajS2]  have  also  used  developmental  data  to 
argue  for  an  interactive  approach  to  language  and  concept  acquisition. 

In  GENESIS,  word  learning  is  integrated  with  schema  acquisition.  This  integration  allows  the 
v.  stem  to  simultaneously  learn  both  a  concept  and  a  word  that  refers  to  the  concept.  Conse¬ 
quently,  GENESIS  can  acquire  definitions  for  words  that  cannot  be  learned  by  a  system  that 
requires  prior  conceptual  knowledge.  In  addition,  the  system  exhibits  behavior  similar  to  that 
observed  in  children's  acquisition  of  word  meanings. 

12.2.  Learning  Role  Labels 

Role  labels  are  words  that  refer  to  the  role  that  an  object  or  character  plays  in  a  schema. 
Examples  of  role  labels  include  "kidnapper."  "ransom."  and  "victim."  The  procedure  GENESIS 
uses  to  learn  role  labels  is  similar  to  the  technique  used  by  the  FOUL-UP  system  [Granger 77] 
except  that  it  is  integrated  with  the  schema  learning  and  schema  suggestion  processes.  When  the 
parser  encounters  an  unknown  word  where  it  expects  a  noun  for  an  object  of  a  particular  class,  a 
dummy  variable  is  created,  annotated  with  the  unknown  word,  and  allowed  to  fill  the  expectation. 
For  example,  the  phrase  "Fred  paid  him  the  ransom"  in  the  kidnapping  narrative  is  parsed  into  the 
assertion:  Atranst Person".  ?PhysicalObiectl .  Person9.  ?AT24)  where  the  variable  '’PhysicalObjectl 
is  marked  with  the  fact  that  it  came  from  the  unknown  word  "ransom."  If  an  input  with  an 
unknown-word  variable  like  '’PhysicalObjectl  matches  a  form  in  a  suggested  schema,  then  a  provi¬ 
sional  definition  for  the  word  is  made  based  on  the  constraints  on  the  schema  variable  that  matches 
the  unknown-word  variable.  For  example,  in  the  case  of  the  unknown  word  "ransom."  the  previ¬ 
ous  sentence  in  the  story  suggests  a  Bargain  schema  between  John  and  Fred.  The  subactions  of  this 
suggested  schema  are: 

1)  John  told  Fred  that  if  Fred  gave  John  the  $250000  at  Trenos  then  John  wouid  release  Mary. 

2)  Fred  gave  John  the  S250000  at  Trenos. 

3;  John  released  Mary. 

Since  "Fred  paid  him  the  ransom"  matches  the  suggested  subaction  "Fred  gave  John  the  $250000  at 
Trer.os"  and  since  "ransom"  fills  the  role  of  the  item  whose  possession  is  transferred,  an  initial 
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definition  is  mace  for  ransom''  staling  that  :  is  a  physical  object  cv  hose  possession  is  transferred 
curing  a  Bargain 

However,  this  is  not  the  final  definition  created  for  “ransom"  since  an  additional  process  is 
performed  when  a  new  schema  is  learned.  Each  of  the  subactions  composing  a  new  schema  is 
checked  for  roles  that  are  filled  by  unknown-word  variables  or  that  were  previously  matched  to 
such  a  variable  resulting  in  an  initial  definition.  In  either  c2se.  a  new  definition  is  created  for  the 
unknown  word  basec  on  the  role  it  fills  in  the  learned  schema  and  the  schema  constraints  on  this 
roie.  Consequently,  when  the  CaptureBargam  schema  is  subsequently  learned  from  the  kidnapping 
narrative,  it  causes  “ransom"  to  be  redefined  as  a  valuable  item  whose  possession  is  transferred  to 
the  actor  m  a  CaptureBargam  schema.  In  other  words,  the  word  "ransom"  is  tentatively  associated 
•with  the  schema  variable  ?yj  in  CaptureBargam.  whose  English  paraphrase  is  repeated  below  for 
convenience: 

"^9  is  a  person.  °c4  is  a  location.  ?r5  is  a  room.  ?c4  is  in  ?r 5.  ?x55  is  a  character.  ?b9  is  free. 
I’xffi  captures  7b9  and  locks  him/her  in  ?r5.  ?a34  is  a  character.  7x55  contacts  7a34  and  tells  it 
that  °b9  is  ?x55's  captive.  ?y5  is  a  valuable.  'I x55  wants  to  have  ?y5  more  than  it  wants  ?b9 
to  be  °xfi5's  captive.  ?a34  has  a  positive  relationship  wfitb  ?b9.  ?a34  has  ?y5.  ^x55  and  ?a34 
carry  out  a  bargain  in  which  7x55  releases  7b9  and  ?a34  gives  7x55  ?y 5  at  7111. 

The  rationale  for  having  learned  schemata  take  precedence  when  defining  new  role  labels  is  that  a 
learned  schema  represents  a  new  situation  and  therefore  new  words  are  deemed  more  likeiv  to  be 
directly  associated  with  it  than  with  an  existing  schema  like  Bargain. 

The  provisional  definition  added  to  the  iexicon  for  a  role  label  contains  two  parts.  The  first  is 
a  set  of  constraints  on  the  object  itself,  such  as  the  constraint  that  "ransom"  refers  to  a  Valuable. 
The  second  part  is  a  suggestion  of  the  schema  of  which  it  is  a  role.  The  fact  that  a  roie  label 
definition  can  suggest  a  relevant  schema  allows  GENESIS  to  use  the  definition  to  construct  explana¬ 
tions  for  narratives  that  it  otherwise  would  not  understand.  For  example,  the  definition  acquired 
for  the  word  "ransom"  is  crucial  to  the  system's  understanding  of  the  following  narrative. 

Input.  Ted  is  Alice's  husband.  John  took  Alice  into  a  room.  Ted  paid  John  the  ransom  and  iohn 
released  Alice. 

Thematic  goal  achieved:  John  is  happy  that  John  has  the  ransom. 


U.C  JT-Ml 


Keahv  i or  Questions. 


>  Summarize 


Alice  was  free.  John  captured  Alice  and  locked  her  in  a  room.  John  contacted  Ted  and  told 
him  that  Alice  was  John's  captive.  John  wanted  to  have  a  ransom  more  than  he  wanted  Alice 
to  be  John's  captive.  Ted  had  a  positive  relationship  with  Alice.  Ted  had  the  ransom  John 
and  Ted  carried  out  a  bargain  in  which  John  released  Alice  and  Ted  gave  John  the  ransom. 


>  Why  did  John  take  Alice  into  the  room? 

Sc  John  could  confine  Alice  in  the  room. 

>  Why  did  Ted  pay  John-7 

Because  Ted  believed  that  if  Ted  gave  John  the  ransom  then  John  would  release  Alice  and 
because  Ted  wanted  Alice  to  be  free  more  than  he  wanted  to  have  the  ransom. 

When  the  word  "ransom"  ;s  encountered  in  this  narrative,  it  suggests  that  the  CaptureBargain 
schema  might  be  relevant.  This  schema  is  then  used  in  a  top-down  fashion  to  construct  an  explana¬ 
tion  for  the  text.  Since  no  other  piece  of  information  suggests  CaptureBargain.  the  learned 
definition  for  "ransom"  is  crucial  in  understanding  this  story.  If  the  definition  for  this  word  is 
removed,  the  system  is  no  longer  acie  to  understand  the  narrative  as  illustrated  by  the  following 


innut:  Ted  is  Alice's  husband.  John  took  Alice  into  a  room.  Ted  paid  John  the  ransom  and  John 
released  Alice. 


Ready  for  questions: 


>  Summarize 

John  moved  Aiice  into  a  room  Ted  gave  John  a  ransom.  John  released  Aiice. 


>  Why  did  John  take  Alice  into  the  room0 
Cannot  find  sufficient  reason. 


>  Whv  c id  Ted  rav  Jonn" 


Cannot  find  sufficient  reason. 

A  definition  that  associates  the  word  "kidnapper''  with  the  mam  character  of  a  CaptureBar- 
gain  schema  (?x55)  is  learned  in  a  manner  directly  analogous  to  the  learning  of  the  word  "ransom." 
This  word  is  also  made  to  suggest  the  new  schema,  and  this  suggestion  allows  GENESIS  to  under¬ 
stand  the  foilow-ing  narrative: 

Input:  Ted  is  Alice's  husband.  A  kidnapper  took  Alice  into  a  room.  Boo  got  75000  dollars  and 
released  Alice. 

Thematic  goal  achieved:  Bob  is  happy  that  Bob  has  the  $75000. 

Ready  for  questions: 

>  Summarize 

Alice  was  free.  Bob  captured  Alice  and  locked  her  in  a  room.  Bob  contacted  Ted  and  told  him 
that  Alice  was  Bob's  captive.  Bob  wanted  to  have  $75000  more  than  he  wanted  Alice  to  be 
Bob's  captive.  Ted  had  a  positive  relationship  with  Alice.  Ted  had  the  S75000.  Bob  and  Ted 
carried  out  a  bargain  in  which  Bob  released  Alice  and  Ted  gave  Bob  the  S75000. 

>  Who  took  Alice  into  the  room'’ 

Bob  moved  Alice  into  the  room. 

>  Why  did  Bob  take  Alice  into  the  room? 

So  Bob  could  confine  Alice  in  the  room. 

>  How  did  Bob  get  the  money? 

Bob  kidnapped  Alice. 

If  the  definition  for  "kidnapper"  is  removed,  the  following  trace  reveals  the  result: 

Input:  Ted  is  Alice's  husband.  A  kidnapper  took  Alice  into  a  room.  Bob  got  75000  dollars  and 
released  Alice. 
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Ready  for  questions: 

>  Summarize 

A  kidnapper  moced  Alice  into  a  room.  Bob  released  Alice. 

>  Who  took  Alice  into  the  room'’ 

The  kidnapper  moved  Alice  into  the  room. 

>  Why  did  Bob  take  Alice  into  the  room? 

Question  refers  to  unknown  event,  person,  place,  or  thing. 
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>  How  did  Bob  get  the  money? 

Cannot  find  sufficient  cause. 

Although  learning  definitions  for  words  such  as  "kidnapper"  and  "ransom."  can  be  very  help¬ 
ful.  many  times  a  new  word  that  fills  a  slot  in  a  schema  will  not  actually  be  a  role  label.  For  exam- 
pie.  consider  replacing  the  word  "ransom"  in  the  kidnap  learning  narrative  with  the  word  "moo¬ 
lah."  Since  the  word  "moolah"  is  unknown.  GENESIS  gives  it  a  definition  identical  to  the  one  :t 
learns  for  "ransom."  In  order  to  be  able  to  recover  from  such  mistakes,  the  system  monitors  the 
schemata  suggested  by  newly  learned  words.  If  a  new  word  subsequently  suggests  a  schema  that 
does  not  explain  any  future  inputs,  the  suggestion  is  removed.  For  example,  consider  processing  the 
following  murder-for-mhernance  story  after  the  system  has  acquired  a  "ransom  definition",  for 
"moolah." 

Mary  had  SlOOOOO.  Stan  murdered  Mary  and  inherited  the  moolah. 

The  word  "moolah"  suggests  a  CaptureBargain  schema:  however,  none  of  the  actions  in  the  narra¬ 
tive  match  the  subactions  in  its  expansion.  Consequently,  the  suggestion  is  considered  to  be 
misleading  and  is  removed  from  the  learned  word. 

12.3.  Learning  Schema  Labels 
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Schema  Icoeis  are  verbs  that  refer  to  an  entire  pian  schema.  Examples  of  schema  labels  induce 
"kidnap,"  "rob."  and  "poison."  Learning  meanings  for  schema  labels  is  a  more  difficult  task  since 
the  relevant  context  is  potentially  much  broader.  A  sentence  such  as  "John  robbed  the  store"  may¬ 
be  used  to  introduce  a  long  piece  of  text  elaborating  the  s.tuation.  to  succinctly  summarize  a 
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pre\  ious  piece  ot  :exi.  or  ;o  re:er  to  a  single  action  in  an  even  larger  plan.  A  fevc  heuristics  have 
been  de'-eioped  that  allow  a  reasonable  guess  to  be  made  regarding  the  referent  of  such  unknown 
•verbs.  The  following  one  is  used  to  resolve  the  meaning  of  "kidnap"  as  used  in  the  kidnap  learnmp 
narrativ  e. 

If  one  character  informs  another  that  some  unknown  action  occurred  and  a  schema  whose  actor 
is  the  same  as  this  action's  was  recently  acquired  from  the  narrative,  and  this  schema  also  has 
roles  hlied  by  the  speaker  and  any  direct  and  indirect  objects  of  the  action,  then  assume  that 
the  speaker  is  summarizing  the  event  and  that  the  unknown  act  refers  to  the  new  schema. 

Specifically,  since  Fred  tells  his  wife  that  "someone  kidnapped  Mary"  and  since  both  he  and  Mary 
were  participants  in  the  jus;  completed  CaptureBargain  schema.  GENESIS  assumes  that  the  word 
"kidnap"  refers  to  CaptureBargain.  A  definition  for  "kidnap"  is  added  to  the  lexicon  where  it  can 
be  used  in  the  parsing  of  future  sentences.  This  definition  states  that  the  word  refers  to  an  instance 
of  CaptureBargain  in  which  the  subject  of  the  clause  is  the  actor  (?x55)  and  the  direct  object  is  the 
victim  (?b9).  Of  course,  the  word  "kidnap"  is  frequently  used  to  refer  to  only  an  act  of  abduction 
rather  than  to  an  entire  kidnap-for-ransom  schema:  however,  the  definition  that  GEXESIS  learns  is 
a  reasonable  one  given  its  current  experience  with  the  new  word.  In  addition,  a  data-structure  Is 
added  to  the  vocabulary  of  the  generator  where  it  can  be  used  in  the  production  of  English  replies 
to  users'  questions.  For  "kidnap",  the  added  data-structure  states  that  an  instance  of  CaptureBar¬ 
gain  can  be  stated  in  English  by  saying:  "7x55  kidnapped  ?b9." 

The  new  definition  of  "kidnap"  in  the  lexicon  is  cruciai  to  GENESIS'  understanding  of  the  fol¬ 
lowing  narrative: 

Input:  Steve  kidnapped  Valerie.  Mike  was  Valerie's  father  and  paid  Steve  $30000. 

Thematic  goal  achieved:  Steve  is  happy  that  Steve  has  the  valuable. 

Ready  for  questions: 

>  Summarize 

V  aiene  was  free.  Steve  captured  Valerie  and  locked  her  in  a  room.  Steve  contacted  Mike  and 
told  him  that  Valerie  was  Steve's  captive.  Steve  wanted  to  have  S30000  mere  than  he  wanted 
Valerie  to  be  Steve's  captive.  Mike  had  a  positive  relationship  with  Valerie.  Mike  had  the 
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S 30000.  Siev  e  ana  Mike  earned  out  a  bargain  in  which  Steve  releasee  Yaierie  and  Mike  gave 
Steve  the  SSOoOO. 

>  Why  did  Mike  pay  Steve  the  money0  ' 

Because  Mike  believed  that  if  Mike  gave  Steve  the  S30000  then  Steve  would  release  Valerie  and 
because  Mike  wanted  Valerie  to  be  free  more  than  he  wanted  to  have  the  S30000. 

>  Why  did  Steve  kidnap  Valerie0 

Because  Steve  wanted  to  have  the  S30000  more  than  he  wanted  Valerie  to  be  Steve's  captive. 

In  this  narrative.  "Steve  kidnapped  Valerie"  is  interpreted  as  describing  an  instance  of  CaptureBar- 
gain  in  which  Steve  is  the  actor  and  Valerie  is  the  victim.  The  assertion  that  "Mike  paid  Steve 
S30000"  matches  a  part  of  the  expansion  of  this  instance  of  CaptureBargain.  and  the  input  "Mike  is 
Valerie's  father"  implies  the  precondition  that  Mike  values  her  freedom  more  than  material  posses¬ 
sions.  If  the  definition  of  "kidnap"  is  removed  from  the  lexicon,  the  narrative  no  longer  directly 
references  CaptureBargain  nor  even  suggests  it.  and  GENESIS  can  no  longer  understand  the  story  as 
illustrated  by  the  following  trace: 

Input:  Steve  kidnapped  Valerie.  Mike  was  Valerie's  father  and  paid  Steve  S30000. 


Thematic  goal  achieved:  Steve  is  happy  that  Steve  has  the  S30000. 
Ready  for  questions: 

>  Summarize 

Steve  did  something  to  Valerie.  Mike  gave  Steve  SSOOOO. 

>  Why  did  Steve  kidnap  Valerie? 

Cannot  find  sufficient  reason. 
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>  Why  did  Mike  pay  Steve  the  money-1 
Cannot  find  sufficient  reason. 
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The  same  technique  used  to  learn  a  definition  for  "kidnap"  has  also  been  used  to  acquire  a 
preliminary  definition  for  the  schema  label  "entrap"  The  details  of  this  example  are  presented  m 
the  discussion  of  the  acquisition  of  a  solicitation-for-entrapment  schema  presented  in  appendix  B. 


12.4.  Comments  on  the  Integrated  Learning  of  Words  and  Concepts 

Procedures  in  GENESIS  for  the  integrated  learning  of  words  and  their  underlying  concepts 
represent  only  a  preliminary  exploration  of  the  potential  interaction  between  language  and  concept 
acquisition.  As  such,  there  are  many  refinements  that  could  be  made  and  many  problems  that  need 
to  be  addressed.  Below  is  a  short  list  of  some  of  the  obvious  areas  for  future  research. 


(!)  The  procedure  for  removing  schema  suggestions  from  new  definitions  is  too  strict.  One 
counter-example  should  not  eliminate  a  suggestion  and  repeated  usefulness  of  a  suggestion 
should  make  it  resistant  to  elimination. 


12)  Morphology  of  unknown  words  should  be  considered.  A  "kidnapper"  is  clearly  the  actor  of  a 
"kidnapping." 

>  3)  More  and  better  heuristics  are  needed  for  determining  whether  a  word  might  be  a  schema  Label 
and  to  what  schema  it  might  refer. 

,4)  Only  role  labels  and  schema  labels  are  considered.  Many  words  do  not  fall  into  either  of  these 
two  categories. 

;fi'  Only  integration  with  explanation-based  learning  has  been  considered.  Integration  with 
similarity-based  learning  should  also  be  examined. 


S _ _  _ _ _ _ _ _ _ 
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CHAPTER  13 

GENESIS:  LEARNED  SCHEMATA  AS  RETRIEVAL  INDICES 


Conceptual  Information  retrieval  involves  indexing  and  retrieving  textual  information  based 
on  an  interpretation  of  its  "meaning.”  As  discussed  in  [SchankSla],  this  approach  has  a  number  of 
dvantages  over  standard  information  retrieval  systems  based  on  keywords.  A  conceptual  infor¬ 
mation  system  Fke  the  CyFr  system  presented  in  [SchankSla]  indexes  specific  textual  passages 
under  the  schemata  that  an  understanding  system  used  in  processing  the  text.  These  schemata  can 
hen  act  as  indices  for  the  retrieval  of  information  by  a  question  answering  system.  In  CyFr. 
FRUMP  [DeJongS2b]  was  used  to  process  news  stories  and  CYRUS  [KolodnerS4]  was  used  to  index 
_r.c  retrieve  episodes  for  the  purpose  of  answering  questions. 

CYRUS  was  also  capable  of  learning  specializations  of  existing  schemata  and  using  these  new 
chemata  to  index  and  retrieve  specific  events.  For  example.  CYRUS  started  with  a  general  schema 
or  diplomatic  meetings  and  then  learned  a  specialization  in  which  military  aid  was  the  topic  of 
discussion.  Specific  episodes  involving  meetings  about  military  aid  were  then  indexed  under  this 
iew  schema,  and  this  indexing  was  used  to  retrieve  answers  to  questions  such  as:  "Who  has  Vance 
alked  to  about  military  aid°"  The  ability  to  learn  new  wavs  of  indexing  episodes  is  important  in 
maintaining  an  efficient  and  useful  dynamic  memory  [SchankS2].  It  is  crucial  for  maintaining 
organization  in  very  large  databases  of  information.  Learning  new  indices  also  allows  a  system  to 
totice  similarities  among  texts  unforeseen  by  the  system's  implementors  and  use  these  similarities 
o  neip  retrieve  related  information  [Moonev87b]. 


Like  CyFr.  GENESIS  indexes  specific  episodes  under  schemata  that  were  used  in  understanding 
he  episode.  In  addition,  since  GENESIS  can  learn  new  schemata,  the  system  is  also  capable  of  learn- 
r.g  new  ways  of  indexing  episodes.  However.  GENESIS  uses  explanation-based  methods  to  learn 
chemata  for  novel  combinations  of  existing  schemata,  unlike  CYRUS,  which  used  empirical 
methods  to  iearn  specializations.  A  number  of  case-based  reasoning  systems  also  learn  new  sche¬ 
mata  for  indexing  episodes  in  memory  iBainS6.  KolodnerST],  However,  like  CYRUS,  these  systems 
reate  schemata  by  noticing  similarities  between  specific  cases  that  were  stored  in  the  same  place  in 
memory.  GENESIS,  on  the  other  hand,  creates  a  schema  from  a  single  instance  and  actually  uses 
his  schema  to  notice  the  similarity  between  two  instances  and  therefore  store  them  in  the  same 
Face  m  memory.  Therefore,  the  learning  of  memory  indices  in  GENESIS  is  very  different  from 
hat  performed  by  systems  using  case-based  reasoning.  The  remainder  of  this  chapter  describes  the 
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simple  indexing  sene  me  used  in  GENESIS  and  gives  examples  of  how  n  can  be  used  to  retrieve 
related  episodes.1 

When  GENESIS  has  finished  processing  a  narrative,  the  indexer  examines  the  highest-level 
explanation  for  each  of  the  thematic  goals  achieved  in  the  narrative.  It  then  stores  the  complete 
causa!  model  created  for  this  specific  narrative  in  the  long-term  store,  indexing  it  under  each  of  the 
schemata  in  these  highest-level  explanations.  An  exception  to  this  rule  is  if  the  system  acquires  a 
nev.  schema  from  an  explanation,  in  which  case  the  episode  is  indexed  under  the  new  schema 
instead  of  under  the  schemata  that  compose  it.  When  answering  questions  about  a  particular  nar¬ 
rative.  the  system  can  be  told  to  "Review  similar  stories."  This  causes  the  system  to  retrieve  past 
episodes  that  are  indexed  under  the  same  schemata  used  to  index  the  present  story  and  make  them 
available  for  question  answering  and  paraphrasing.  Reviewing  is  done  by  temporarily  replacing  the 
causal  model  of  the  current  text  with  the  causal  model  previously  constructed  and  saved  for  the 
episode  being  reviewed.  For  example,  consider  the  state  of  the  system  after  it  has  processed  both 
the  learning  and  test  narratives  for  both  the  kidnap  and  murder  examples.  The  following  trace 
shows  the  behavior  of  the  system  when  processing  a  third  murder-for-inheritance  story: 

Input:  Mary  is  the  Bob's  mother  and  is  a  millionaire.  Someone  strangled  Mary.  Bob  got 

S 1000000. 


"hematic  goal  achieved:  Bob  is  happy  that  Bob  has  the  S1000000. 


heacv  : or  ouestions: 


>  Summarize 

Bob  murdered  Mary.  Mary  had  S1000000.  Bcb  was  Mary's  heir.  Bob  inherited  Mary's 

S 1 000000. 


>How  did  Bob  murder  Mary? 
Bob  strangled  Mary. 


>  Why  is  Bob  Mary's  heir1 

■A  description  o:'  GENESIS'  ability  '.o  use  learned  scderr.a'.a  to  index  episodes  is  also  presentee  ;n  [.Mooney So;. 
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Because  Marv  was  3ob's  mother. 


>  Review  similar  stones 


There  are  2  other  instances  of  this  schema.  Enter  number  of  story  to  be  reviewed  >  1 
Reviewing  MurderlnheritStoryl 

Ready  for  questions: 


>  Summarize 

Agrippina  murdered  Claudius.  Claudius  had  an  estate.  Agrippina  was  Claudius's  heir.  Agrip- 
rma  inherited  Claudius’s  estate. 


>  How  did  Agrippina  murder  Claudius? 
Agrippina  poisoned  Claudius  with  the  mushroom. 

>  Why  is  Agrippina  Claudius's  heir? 

Because  Claudius  was  Agrippina’s  husband. 


1  here  are  2  other  instances  of  this  schema.  Enter  number  of  story  to  be  reviewed  >  2 
Reviewing  Murder InheritStorv2 


Readv  for  Questions: 


>  Summarize 

Gene  murdered  Martha.  Martha  had  $1000000.  Gene  was  Martha’s  heir.  Gene  inherited 
Martha's  $1000000. 


>  How  did  Gene  murder  Martha? 
Gene  shot  Martha  and  killed  her. 


There  are  2  other  instances  of  this  schema.  Enter  number  of  storv  to  be  reviewed  > 
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Rec  :ew  finished. 

>  Review  kidnapping  stories 

There  are  2  instances  of  CaptureBargain.  Enter  number  of  story  to  be  reviewed  >  1 
Review  ing  CaptureEargainStorv  1 

Ready  for  questions: 

>  Summarize 

Mary  was  free.  John  captured  Mary  and  locked  her  in  a  room.  John  contacted  Fred  and  told 
him  that  Mary  was  John's  captive.  John  wanted  to  have  S250000  more  than  he  wanted  Mary 
to  be  John's  captive.  Fred  had  a  positive  relationship  with  Mary.  Fred  had  the  $250000.  John 
and  Fred  carried  out  a  bargain  in  which  John  released  Mary  and  Fred  gave  John  the  $250000  at 
Trenos. 


\ 


V 
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>How  did  John  communicate  to  Fred? 

John  called  Fred  and  told  him  that  Mary  was  John's  captive. 

>Why  did  Fred  want  Mary  free1 
3ecause  Fred  was  Mary's  father. 


> 

There  are  2  instances  of  CaptureBargain.  Enter  number  of  story  to  be  reviewed  > 

Review  finished. 

An  additional  feature  that  is  illustrated  in  this  trace  is  GENESIS’  ability  to  use  learned  words  in 
the  retrieval  of  past  episodes.  For  example,  given  the  definition  acquired  for  "kidnap”  and  some 
know  ledge  of  English  morphology,  the  system  interprets:  ''Review  kidnapping  stories"  as  a  com¬ 
mand  to  retrieve  past  instances  of  the  CaptureBargain  schema. 


CHAPTER  14 


THE  PSYCHOLOGICAL  STATUS  OF  EXPLANATION-BASED  LEARNING 

Most  theories  of  concept  and  schema  acquisition  in  cognitive  psychology  have  been 
similarity-based  and  postulated  learning  mechanisms  based  on  inductive  generalization  across 
numerous  examples  (e.g.  [Anderson79.  MedinS7b,  Posner68.  Rumelhart78.  Thorndvke79]).  How¬ 
ever.  several  recent  articles  [\ledinS7a.  MurphySS.  Schank86b]  have  criticized  this  approach  for  its 
failure  to  acknowledge  the  importance  of  subjects'  existing  knowledge  and  its  inability  to  explain 
subjects'  preference  for  correlations  that  have  causal  explanations. 

This  raises  the  interesting  question  of  whether  an  explanation-based  learning  mechanism  can 
be  productively  interpreted  as  a  cognitive  model  of  certain  types  of  human  learning.  Although  a 
number  of  psychological  experiments  demonstrate  people's  ability  to  learn  concepts  or  schemata 
from  two  examples  using  analogy  [Gick83,  Spencer86]  or  from  many  examples  using  similarity- 
based  induction  [MedinS7b.  Posr.er6S],  until  very  recently,  there  were  no  experiments  that  demon¬ 
strated  people's  ability  to  learn  a  concept  or  schema  by  explaining  a  single  example.  This  chapter 
reviews  some  recent  experiments  that  indicate  that  people,  like  GENESIS,  can  learn  a  plan  schema 
by  explaining  and  generalizing  a  single  specific  instance  presented  in  a  narrative.  These  experiments 
are  more  completely  reviewed  in  [AhnS7a]  and  details  on  the  experimental  methodology  are  given 
in  [AhnSTbj. 

14.1.  Overview  of  the  Experiments 

The  overall  design  of  the  experiments  involved  subjects  reading  a  single  narrative  describing  a 
specific  instance  of  a  novel  plan  and  performing  a  variety  of  tests  constructed  to  determine  whether 
or  not  they  had  acquired  an  abstract  schema  from  this  single  example.  Three  passages  were  con¬ 
structed  to  present  situations  for  which  the  subjects  presumably  did  not  already  have  a  pre- 
established  schema  but  which  they  could  understand  using  pian-b3sed  understanding.  For  example, 
one  passage  involves  a  cooperative  buying  scheme  used  in  other  countries.  In  Korea  the  system  is 
called  a  "Kyeah"  and  in  India  it  is  called  a  "chit  fund".  The  experimental  narrative  describing  a 
single  instance  of  this  plan  follows: 

Tom.  Sue.  Jane,  and  Joe  were  all  friends  and  each  wanted  to  make  a  large  purchase  as  soon  as 
possible.  Tom  wanted  a  VCR.  Sue  wanted  a  microwave.  Joe  wanted  a  car  stereo,  and  Jane 
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warned  a  compact  disk  player.  However,  they  each  only  had  S50  left  at  the  end  of  each  month 
after  paying  their  expenses.  Tom.  Sue.  Jane,  and  Joe  all  got  together  to  solve  the  problem. 
They  made  four  slips  of  paper  wmh  the  numbers  1.2.3.  and  4  written  on  them.  They  put  them 
m  a  hat  and  each  drew  out  one  slip  Jane  got  the  slip  with  the  4  written  on  it.  and  said.  "Oh 
darn.  I  have  to  wait  to  get  my  CD  player."  Joe  got  the  slip  with  the  1  written  on  it  and  said. 
"Great.  I  can  get  my  car  stereo  right  away!"  Sue  got  the  number  2.  and  Tom  got  number  3.  In 
January,  they  each  contributed  the  S50  they  had  left.  Joe  took,  the  whole  S2C0  and  bought  a 
Pioneer  car  stereo  at  Service  Merchandise.  In  February,  they  each  contributed  their  S50  again. 
This  time.  Sue  used  the  S200  to  buy  a  Sharp  600  watt  1.5  cubic  foot  microwave  at  Service 
Merchandise.  In  March,  ail  four  again  contributed  $5 0.  Tom  took  the  money  and  bought  a 
Sanyo  Beta  VCR  with  wired  remote  at  Service  Merchandise.  In  April.  Jane  got  the  S200  and 
bought  a  Technics  CD  player  at  Service  Merchandise. 

In  addition  to  a  group  given  specific  narratives  (the  instance  group),  some  experiments  also  used  a 
control  group  that  was  given  abstract  descriptions  of  the  schemata  underlying  each  of  the  example 
narratives  (abstract  group).  The  description  of  the  Kyeah  schema  given  to  the  abstract  group  fol¬ 
low's: 


Suppose  there  are  a  number  of  people  (let  the  number  be  r.)  each  of  whom  wants  to  make  a 
large  purchase  but  does  not  have  enough  cash  on  hand.  They  can  cooperate  to  solve  this  prob¬ 
lem  by  each  donating  an  equal  small  amount  of  money  to  a  common  fund  on  a  regular  basis. 
(Let  the  amount  donated  by  each  member  be  m.)  They  meet  at  regular  intervals  to  collect 
everyone's  money.  Each  time  money  is  collected,  one  member  of  the  group  is  given  all  the 
money  collected  (n  X  m)  and  then  with  that  money  he  or  she  can  purchase  what  he  or  she 
wants.  In  order  to  be  fair,  the  order  in  which  people  are  given  the  money  is  determined  ran¬ 
domly.  The  first  person  in  the  random  ordering  is  therefore  able  to  purchase  their  desired  item 
immediately  instead  of  having  to  w-ait  until  they  save  the  needed  amount  of  money.  Although 
the  iast  person  does  not  get  to  buy  their  item  early,  this  individual  is  no  worse  off  than  they 
would  have  been  if  they  waited  until  they  saved  the  money  by  themselves. 

Since  subjects  in  the  abstract  group  had  beer,  directly  told  the  content  of  the  schema,  they  were 
presumed  to  have  learned  the  schema.  Consequently,  if  the  instance  group  performed  as  well  as 
the  abstract  group  or.  a  task  requiring  knowledge  of  the  general  schema,  then  it  is  reasonable  to 
assume  that  the  subjects  :r.  the  instance  group  had  also  acquired  the  schema. 
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Before  any  of  the  experiments  were  conducted,  the  experimenters  analyzed  the  instance  pas¬ 


sages  from  an  E3L  perspective  and  determined  a  set  of  variables  and  constraints  characterizing  the 


schema  that  could  be  learned  from  generalizing  an  explanation  of  the  narrative.  A  constrain:  in 


this  context  is  defined  as  a  property  that  supports  the  explanation  for  how  the  thematic  goals  in  the 


narrative  were  achieved.  For  example,  the  variables  and  constraints  for  the  Kveah  passage  are 
shown  in  Table  14.1.  In  order  to  determine  whether  a  learned  schema  agreed  with  that  predicted 
by  EBL.  subjects'  learning  in  each  task  was  judged  based  on  how  well  they  obeyed  the  constraints 
and  recognized  the  mutability  of  the  variables. 


14.2.  Experiment!:  Abstract  Description  Generation 


The  first  experiment  investigated  whether  subjects  could  acquire  a  schema  from  a  single 
example  by  asking  them  to  "write,  in  abstract  terms,  a  description  of  the  general  technique  illus¬ 
trated  in  the  narrative."  Only  the  instance  group  was  used  in  this  experiment,  since  this  task 
would  simply  be  a  memory  test  for  the  abstract  group.  In  general,  subjects  in  the  instance  group 
produced  good  descriptions  of  the  schema.  The  following  is  the  description  of  the  Kveah  schema 
written  by  one  subject: 


Suppose  in  a  group  of  people,  each  person  w-ould  like  to  buy  something  expensive,  but  over  a 
period  of  time,  each  person  cannot  earn  enough  to  buy  what  he  would  like.  By  using  random 
selection,  each  person  could  be  assigned  a  number,  when  the  group  had  saved  enough  money 


together  to  purchase  an  item,  the  person  with  the  first  number  would  get  his  item.  This  would 


continue  for  the  rest  of  the  group  until  everyone  had  gotten  what  he  wished. 


Table  14.1:  Variables 
Variables 


;  identity  of  participants 
number  of  participants  (n) 

:  exact  time  of  meetings 
|  interval  between  meetings  (t) 
;  amount  of  donation  (m) 

;  items  bought 
stores  where  items  bought 
;  method  cf  determining  order 


and  Constraints  for  the  Kyeah  Schema _ 

_ _ Constraints _ 

.  participants  want  items  of  similar  value 
participants  cannot  afford  items 
participants  trust  each  other 
i  participants  can  afford  m  each  t 
each  participant  donates  same  amount 
cost  of  desired  items  —  n  X  m 
i  number  of  meetings  -  number  of  participants 
order  must  be  assigned  randomiv 
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A  more  objective  measure  of  their  performance  is  that,  overall,  subjects  given  a  specific  instance 
explicitly  mentioned  7 5%  ot  the  constraints  and  identified  89%  of  the  variables  m  their  descriptions 
of  the  general  schema.  Details  on  the  scoring  methods  used  are  given  in  [AhnS7bJ. 

14.3.  Experiment  2:  New  Instance  Generation 

The  second  experiment  was  constructed  to  determine  whether  subjects  in  both  the  instance 
and  abstract  group  could  produce  another  specific  instance  of  the  learned  schema.  Subjects  in  the 
instance  group  were  told  tc  "write  another  story  in  which  characters  use  the  general  method  illus¬ 
trated  in  the  story  but  that  is  otherwise  as  different  as  possible"  while  subjects  in  the  abstract 
group  were  told  to  "write  a  story  in  which  particular  individuals  use  the  technique  described  in  the 
passage  in  a  specific  case."  In  general,  both  groups  produced  equally  good  narratives.  The  following 
is  the  new  Kyeah  narrative  written  by  one  subject  in  the  instance  group. 

Bill.  Kim.  John  and  Mary  were  all  business  associates.  Bill  wanted  some  land  in  Northern  Illi¬ 
nois.  Kim  wanted  a  new  house  in  Switzerland.  John  wanted  a  new  Porshe  92SS  with  all  all 
accesories.  and  Mary  wanted  to  take  a  trip  around  the  world.  The  only  problem  was  they  each 
only  had  S25.000.00  left  unspent  at  the  end  of  each  month.  They  all  got  together  and  picked 
random  variables  on  Bill's  business  computer.  Mary  was  farthest  from  her  variable  so  she 
would  have  to  wait  till  last  to  get  her  trip  around  the  world.  John  nailed  his  variable  and 
jumped  enthusiastically  saying.  "Yea.  I  get  to  get  my  new  Porshe  92SS  right  now."  They  each 
talked  with  their  banker  and  drew  the  S25  Thousand  dollars  out  and  pooled  it  together  after 
the  first  month  and  the  next  day  John  drove  up  in  his  new.  black.  928S  with  all  accessories.  At 
the  end  of  the  next  month  they  again  pooled  their  money  and  Kim  got  her  chalet  in  Switzer¬ 
land.  Again  at  the  end  of  the  next  month  they  pooled  their  money  an  Bill  got  his  land  in 
Northern  Illinois.  Finally,  after  the  fourth  month  they  pooled  their  money  together  and  Mary 
left  for  her  trip  around  the  world. 

A  more  objective  measure  of  their  relative  performance  is  that  the  stories  written  by  the  instance 
group  explicitly  and  correctly  instantiated  Sl%  of  the  constraints  compared  to  76%  for  the  abstract 
group.  The  difference  between  the  two  groups  is  insignificant,  indicating  that  subjects  in  both 
groups  learned  the  schema  equally  well. 
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14.4.  Experiment  3:  Yes/No  Questions 

The  third  and  final  experiment  directly  tested  whether  subjects  could  correctly  identify 
whether  cr  not  a  particular  component  was  a  variable  or  a  constraint  ir.  the  general  schema  Sub¬ 
jects  in  both  the  instance  and  abstract  group  were  explicitly  asked  yes/no  questions  like:  '‘Can  some 
people  consistently  donate  less  than  others  and  have  the  system  work'1"  and  “Is  there  any  particu¬ 
lar  number  of  people  required  for  this  plan?"  Overall,  the  two  groups  were  able  to  answer  these 
cues; ions  ecuallv  well.  The  instance  group  correctly  answered  S5°"c  of  the  questions  compared  to 
S 1  ~e  for  the  abstract  grout?. 

14.5.  Discussion  of  the  Experimental  Results 

In  general,  the  experiments  reviewed  m  this  chapter  support  the  claim  that  people  can  learn  a 
schema  by  explaining  and  generalizing  a  single  narrative  a  la  GENESIS.  However,  also  like 
GENESIS,  people  can  only  perform  explanation-based  learning  when  they  can  explain  all  of  the 
actions  that  compose  the  schema.  In  [AhnS7b],  additional  experiments  are  presented  that  demon¬ 
strate  subjects  inability  to  learn  schemata  from  narratives  for  which  they  cannot  construct  com¬ 
plete  causal  explanations.  For  example,  subjects  could  not  learn  a  schema  for  an  American  Indian 
potlatch  ceremony  from  one  example  since  they  could  not  determine  motivations  for  all  of  the 
actions  in  the  example. 

There  are  also  some  interesting  observations  that  can  be  made  regarding  subjects'  apparent 
generalization  processes.  For  example,  unlike  SOAR  (see  section  3.8).  they  obviously  do  not  simply 
change  each  constant  in  the  explanation  to  an  independent  variable.  Since  in  the  Kyeah  narrative 
ail  of  the  characters  made  their  purchases  at  the  same  store  (i.e.  Service  Merchandise),  simply 
changing  the  constant  representing  this  store  to  a  variable  would  result  in  a  schema  in  which  all  of 
the  participants  had  to  buy  their  items  at  the  same  store.  However,  in  none  of  the  experiments  did 
any  of  the  subjects  believe  that  this  was  a  constraint  on  the  underlying  schema.  Of  course,  this 
does  not  imply  that  subjects  must  therefore  use  a  generalization  process  analogous  to  the  unification 
algorithms  discussed  in  chapter  3:  however,  it  does  indicate  that  people’s  generalization  processes 
ire  not  as  simple  as  cnanging  constants  to  variables.  Subjects'  generalization  process  takes  greater 
advantage  cf  their  existing  knowledge  of  the  domain.  In  fact,  it  is  clear  that  in  some  wavs  people's 
ability  to  generalize  explanations  is  even  more  powerful  than  algorithms  like  EGGS,  EBG.  and 
STRIPS.  For  example,  m  the  Kyeah  example,  almost  ail  of  the  subjects  realized  that  the  plan 
would  work  with  anv  number  of  people  as  long  as  there  was  an  equivalent  number  of  meetings  for 
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collecting  ar.d  allocating  the  money.  However,  producing  this  generalization  requires  recognizing 
the  repeated  structure  of  the  original  example  and  producing  a  general  iterative  plan  for  the  overall 
schema.  Performing  this  process  of  generalizing  to  n  from  one  example  in  a  machine  learning  sys¬ 
tem  requires  additional  generalization  techniques.  BAGGER  [ShavlikS7a.  ShavlikSS]  is  an  EBL  sys¬ 
tem  which,  in  addition  to  EGGS,  uses  a  process  for  analyzing  recursive  rule  applications  in  order  to 
learn  iterative  plans  from  one  example. 


CHAPTER  15 


CONCLUSIONS  AND  FUTURE  WORK 

This  thesis  has  demonstrated  that  a  general  explanation-based  learning  mechanism  can 
efficiently  learn  and  improve  its  performance  in  a  wide  variety  of  domains.  In  particular,  it  has 
shown  that  a  general  EBL  mechanism  is  capable  of  improving  performance  on  the  complex  task  of 
understanding  narrative  text  by  acquiring  schemata  for  novel  plans  that  achieve  important  goals. 
Finally,  it  has  reviewed  empirical  evidence  that  indicates  that  explanation-based  learning  is  a  plau¬ 
sible  model  of  certain  types  of  human  learning  in  this  domain.  This  final  chapter  summarizes  the 
unique  features  of  the  research  presented  in  this  thesis  and  outlines  some  problems  requiring  future 
research. 

15.1.  Relation  to  Other  Work 

Below-  are  four  aspects  that  make  the  work  presented  in  this  thesis  unique: 

:  1)  It  is  based  on  a  general  domain-independent  explanation-based  learning  mechanism  which  has 
been  tested  on  numerous  examples  from  different  domains. 

It  has  addressed  the  problem  of  generalizing  the  temporal  order  of  actions  in  plan-based 
explanations. 

3;  It  has  illustrated  the  ability  of  EBL  to  improve  the  performance  of  an  understanding  system 
as  well  as  a  problem-solving  system. 

1  •» 1  It  has  examined  the  psychological  plausibility  of  an  explanation-based  learning  system. 

15.1.1.  Generality  and  Domain-Independence 

Earlier  explanation-based  learning  systems  were  constrained  by  the  particular  domains  to 
which  they  were  applied  and  were  never  tested  on  examples  from  a  variety  of  domains.  This 
includes  such  systems  as  STRIPS  [Fikes72],  LEX2  [Mi tchellS3 ].  CUPS  [WinstonS3],  LP  [SiiverS3]. 
MA  [0‘RorkeS4],  ARMS  [Segre85],  LEAP  [MitcheilSfi],  PHYSICS-101  [ShavlikSf],  and  the  original 
GENESIS  [MooneyS5a]. 

More  recent  EBL  systems  such  as  PROLOG-EBG  [Kedar-CabelliS7a].  MRS-EBG  [HirshS 7]. 
PROLEARN  [PrreditisS 7 ].  and  PRODIGY  [MintonS7a.  MintonS  7b]  are  general  domain-independent 
EBL  systems;  however,  these  systems  have  not  been  tested  on  as  wide  a  variety  of  examples. 
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According  io  [Kedar-CabelliS'a],  "PROLOG-EBG  has  been  tested  on  the  'cup'  and  'safe-to-stack' 
examples,  and  the  suicide'  example"  (p.  389).  In  [HirshS7],  the  examples  presented  are  the  Safe- 
To-Stack  and  LEX2  examples;  no  other  examples  are  mentioned.  In  [PrieditisS7j.  the  Safe-To-Slack 
and  LEAP  examples  and  examples  on  list  membership  and  Towers  of  Hanoi  are  the  only  ones  men¬ 
tioned.  In  [MintonS7b].  it  states  that  PRODIGY  has  been  tested  on  machine  shop  scheduling  and  3- 
D  robotics  construction  as  well  as  the  "blocks  world." 


i 

I 


.s 

,N 


.  * 


In  addition,  none  of  these  systems  support  ail  of  the  representational  formalisms  supported  in 
EGGS  (i.e.  Horn  clauses,  rewrite  rules,  and  STRIPS  operators).  PROLOG-EBG.  MRS-EBG.  and 
PROLEAR.Y  only  use  Horn  clauses  and  PRODIGY  supports  STRIPS  operators  and  Horn  clauses. 
Aiso.  except  for  PRODIGY,  none  of  these  systems  have  been  integrated  with  a  performance  system 
operating  in  a  complex  domain.  Finally,  unlike  EGGS,  none  of  these  systems  have  facilities  for 
learning  from  the  observed  problem  solving  behavior  of  other  agents. 

15.1.2.  Generalizing  Temporal  Ordering 

Previous  research  on  EBL  and  macro-operators  has  not  addressed  the  problem  of  learning 
macro-operators  with  partially  ordered  actions.  STRIPS  [Fikes72]  and  most  other  systems  that 
learn  macro-operators  (e.g.  [Iba85.  Korf85.  Minton85])  learn  linear  sequences.  Some  recent  work 
has  addressed  the  issue  of  learning  iterative  macro-operators  [ChengS6.  PriediiisS6.  Shavlik87a]. 
However,  the  packaging  procedure  presented  in  section  6.2.2  is  apparently  the  first  system  that 
’.earns  macrc-operators  with  partially-ordered  actions.  Although  the  procedure  for  generating 
partially  -ordered  macro-operators  is  closely  related  to  certain  processes  in  nonlinear  planning,  there 
are  a  number  of  important  differences  which  were  discussed  in  section  6. 2. 2. 6. 

15. 1-3.  Learning  for  Understanding 

Unlike  GE.YESIS.  almost  all  machine  learning  systems  with  performance  components  use 
learning  to  improve  the  abilities  of  a  probiem-solver  or  planner  rather  than  an  understander.  IPP 
[LebowitzSO,  SchankS2]  is  still  one  of  the  few  machine  learning  systems  that  improved  its  ability 
to  understand.  IPP  processed  news  articles  and  used  a  similarity-based  approach  to  learn  speciali¬ 
zations  of  existing  schemata.  For  example,  the  system  already  had  a  schema  for  kidnapping  and 
after  processing  several  stories  that  described  kidnappings  in  Italy  carried  out  by  the  terrorist 
group  the  Red  Brigades,  it  created  a  specialized  schema  for  kidnappings  in  Italy  in  which  the  Red 
Brigades  was  the  default  kidnapper.  Later,  if  it  encountered  an  article  describing  a  kidnapping  in 
Italy  in  which  the  kidnappers  were  not  mentioned,  it  assumed  the  Red  Brigades  was  the  responsible 
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pu'ty  Besides  using  a  similarit y-based  instead  of  an  explanation-based  approach  to  learning.  1PP 
learr.ec  specializations  o’’  existing  schemata  instead  of  new  schemata  for  compositions  of  actions 
tr.at  achieve  important  goals.  As  mentioned  in  chapter  13.  CYRL’S  [KoiodnerM]  also  used 
similarity-based  methods  to  learn  specializations  of  existing  schemata,  which  it  used  to  index  and 
retrieve  specific  episodes. 

Other  than  GENESIS,  the  only  EBL  system  that  improves  its  ability  to  understand  external 
problem  solving  behavior  is  the  ARMS  system  [SegreSTa],  ARMS  uses  learned  schemata  to  aid  the 
understanding  of  assembly  sequences  in  a  robotics  domain.  However,  as  mentioned  in  section 

10.1.3.  the  understanding  system  in  ARMS  is  incapable  of  inferring  missing  actions  and  therefore, 
unlike  GENESIS,  the  plan  schemata  it  learns  can  not  be  used  to  fill  in  gaps  in  future  observations. 

15.1.4.  Psychological  Plausibility 

This  research  is  also  unique  in  that  it  has  motivated  psychological  experiments  that  demon¬ 
strate  that  people  can  perform  explanation-based  learning  from  a  novel  plan  presented  in  a  narra¬ 
tive  [AhmSTa.  Ahn87b].  There  are  currently  no  other  psychological  experiments  specifically 
directed  at  judging  the  ability  of  an  EBL  system  to  model  human  learning.  Nevertheless,  as 
reviewed  in  [MurphyS5].  there  is  a  substantial  amount  of  psychological  research  that  reveals  the 
important  effect  subjects'  background  knowledge  and  naive  theories  of  the  world  have  on  the  pro¬ 
cess  of  concept  acquisition. 

15.2.  Problems  for  Future  Research 

The  research  presented  in  this  thesis  as  well  as  numerous  related  projects  has  revealed  a 
number  of  interesting  problems  that  need  to  be  addressed.  This  section  briefly  discusses  several 
problem  areas  in  explanation-based  learning  and  mentions  current  research  efforts  that  are  attempt¬ 
ing  to  confront  these  problems. 

15.2.1.  The  Effect  of  Learning  on  Performance 

As  illustrated  with  empirical  data  in  chapter  7.  expiar.atior.-based  learning  can  have  both 
positive  and  negative  effects  on  future  performance.  However,  for  unrestricted  learning  of  macros 
in  problem  solving,  there  is  conflicting  empirical  evidence  regarding  which  of  these  effects  dom¬ 
inates  in  the  long  run  [MintonS5.  0'RorkeS7a.  ShavlikSS],  Consequently,  further  empirical  and 
theoretical  analysis  of  the  effect  of  learning  on  performance  is  required.  Of  course,  learning  and 
forgetting  the  right  macros  is  crucial  to  performance  improvement 
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15.2.2.  Operationality  and  Pruning 


Learning  macros  or  schemata  at  the  appropriate  level  of  generality  is  one  aspect  of  insuring 
that  learning  improves  performance.  However,  there  is  currently  no  general  domain-independent 
characterization  of  what  makes  a  concept  operational.  In  EGGS,  pruning  for  operationally  requires 
a  special  procedure  for  each  domain.  In  PROLOG-EBG,  arbitrary  axioms  are  used  to  define  particu¬ 
lar  predicates  as  operational  and  in  MRS-EBG.  an  arbitrary  set  of  axioms  can  be  used  to  prove  that 
an  expression  ls  operational.  While  these  approaches  allow  for  flexibility  in  determining  opera- 
tionality.  they  do  not  help  to  characterize  the  notion  of  operationality  in  a  general  way.  Some 
recent  research  in  EBL  has  been  directed  at  the  problem  of  determining  operational  descriptions. 
Systems  in  particular  domains  have  been  used  to  explore  learning  schemata  at  different  levels  of 
generality  [SegreS7b.  ShavlikS7bi  and  the  METALEX  system  [K.eller87a]  searches  for  specific  con¬ 
cept  descriptions  that  empirically  improve  performance.  However,  there  is  currently  no  analytical 
characterization  of  operationality.  In  fact,  current  experience  probably  indicates  that  such  a  char¬ 
acterization  is  not  forthcoming  and  that  heuristic  and  empirical  methods  will  continue  to  be  the 
only  ways  of  determining  an  appropriate  level  of  generality  for  what  is  learned. 

15.2.3.  Deciding  What  to  Learn 

A  slightly  different  problem  is  determining  whether  or  not  a  particular  macro  or  schema  is 
worth  iea.-mng  at  all.  Learning  large  numbers  of  useless  rules  only  exacerbates  the  problem  of 
slowdown.  As  mentioned  in  chapter  7.  there  has  been  some  research  on  analytical  and  empirical 
methods  for  determining  utility.  In  particular,  this  has  beem  the  primary  focus  of  Minton's 
research  (M:ntonS5.  MintonS  7a].  However,  the  problem  of  determining  what  must  be  learned  and 
what  must  be  forgotten  in  order  to  improve  overall  system  performance  is  stiii  largely  an  open 
question. 

15.2.4.  Understanding  Observed  Behavior 

Learning  from  observed  problem  solving  behavior  like  that  performed  by  GENESIS  and  vari¬ 
ous  learning  apprentice  systems  [MitchellS5.  0'Rorke57b.  SegreS7a.  WilkinsSd],  is  ar.  efficient 
method  for  acquiring  expert  knowledge.  Since  this  process  requires  the  ability  to  explain  the  inten¬ 
tional  actions  of  external  agents,  the  process  of  understanding  or  plan  recognition.  [Schmidt78], 
becomes  an  important  part  of  the  learning  process.  However,  understanding  is  a  difficult  process 
which  :s  far  from  completely  understood  despite  the  fact  that  it  has  been  fairly  extensively  studied 
m  natural  language  processing.  The  understanding  component  in  GENESIS  and  other  current 


of  situations  that  humans  can 


systems  :s  clearly  ircapabie  of  explaining  the  wide  variety 
comprehend.  0!  course,  research  in  this  area  is  continuing  and  there  has  recently  been  seme  new 
approaches  to  the  problem.  Recent  work  by  Charniak  [CharmakS6]  and  Kautz  and  Allen  [KautzSb] 
■a.  ere  briefly  discussed  in  section  10.1.3.  Recent  understanding  research  by  Schank  and  his  col¬ 
leagues  [KassS6.  LeakeS6.  SchankS6a]  has  involved  the  SWALE  system,  which  constructs  explana¬ 
tions  for  situations  by  modifying  or  tweaking  existing  schemata  or  explanation  patterns.  This 
approach  is  an  attempt  to  avoid  the  combinatorially  explosive  nature  of  plan-based  understanding 
x  h :ie  maintaining  the  ability  to  explain  novel  situations. 

15.2.5.  Intractable  Domain  Theories 

In  EBL.  the  combinatorially  explosive  nature  of  explaining  why  a  plan  works  or  why  an 
example  is  a  member  of  a  concept  has  been  referred  to  as  the  intractable  theory  problem 
[MitchellSb].  I  n  any  realistic  domain,  constructing  a  complete  explanation  for  why  a  plan  actually 
works  is  an  expensive  process.  For  example,  the  kidnapping  schema  GEXESIS  learns  is  actually 
O’-eriy-general  because  it  does  not  consider  the  possibility  that  the  police  or  other  counter-agents 
might  interfere  with  the  plan.  The  act  of  communication  between  the  kidnapper  and  the  ransom 
payer  should  actually  be  constrained  so  that  it  does  not  reveal  the  kidnapper's  identity  and  thereby 
result  m  his  arrest.  GEXESIS  is  implicitly  making  the  assumption  that  external  agents  will  not 
interfere;  however,  it  has  no  way  of  recovering  if  this  assumption  is  violated.  Attempting  to  con¬ 
strain  the  plan  to  prevent  any  possible  interference  on  the  part  of  any  other  agent  is  clearly  not  a 
tractacie  solution.  Systems  that  attempt  to  deal  with  this  problem  are  described  in  [ChienS7a. 
ChienSTbj.  The  basic  approach  taken  in  Chien’s  work  is  to  make  simplifying  assumptions  during 
the  explanation  process  in  order  to  make  it  tractable.  When  a  subsequent  failure  occurs,  it  is 
explained  in  terms  of  a  violation  of  an  initial  assumption  and  the  learned  concept  is  refined  to 
account  for  the  failure.  Alternative  approaches  to  dealing  with  the  intractable  domain  theory  prob¬ 
lem  are  described  in  [Ber.nettS 7.  Doy:e$6.  TadepaII156]. 

15.2.6.  Incomplete  Domain  Theories 

Corresponding  to  the  intractable  theory  problem  is  the  incomplete  theory  problem  [MitcheilSb] 
:r.  which  the  current  domain  theory  is  incapable  of  completely  explaining  an  example.  One 
approach  to  refining  an  initial  domain  theory  based  on  experimentation  is  described  in  [Rajamc- 
r.e y  S 5 .  Raiamcr.eyST],  When  a  prediction  supported  by  the  current  theory  is  contradicted  by 
empirical  evidence,  the  system  examines  the  domain  axioms  underlying  the  faulty  prediction  and 
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conducts  experiments  to  determine  exactly  which  of  these  axioms  is  incorrect.  Another  approach  to 
dealing  with  incompleteness  is  to  use  similarity-based  methods  to  induce  initial  causal  theories 
L AndersenSTa.  PazzaniS7j. 

15.2.7.  Hybrid  Learning  Methods 

Approaches  to  dealing  with  intractable  and  incomplete  theories  frequently  involve  integrating 
explanation-based  and  similarity-based  learning.1  For  example,  integrated  approaches  can  be  used 
to  deal  with  intractabiiitv  by  using  the  detection  of  similarities  to  focus  the  explanation  process. 
This  approach  is  taken  in  [LebowitzS6],  in  which  similarities  in  the  congressional  voting  record  are 
used  tc  focus  the  process  of  explaining  why  certain  members  of  congress  voted  for  or  against  a  par¬ 
ticular  bill.  If  the  incompleteness  arises  from  the  fact  that  operators  are  represented  procedurallv 
instead  of  declarativelv.  empirical  techniques  can  be  used  in  the  actual  process  of  generalizing 
explanations  (PorterS5.  PorterS6]. 

Correspondingly.  EBL  methods  can  be  used  to  address  problems  with  purely  empirical 
approaches.  For  example,  explanations  of  particular  examples  can  be  used  to  select  relevant 
features  for  similarity-based  methods  [Danyh ikS7.  FhnnSS,  .Mitchell84],  Also,  goal-related 
knowledge  can  be  used  to  guide  feature  selection  for  conceptual  clustering  systems  [MogensenS7. 
Stepp86].  In  general,  this  approach  allows  explanations  to  contribute  to  a  similarity-based  system's 
inductive  bias  [Uigoff56j. 

Current  approaches  to  combining  the  two  learning  methods  effectively  use  one  of  the  methods 
to  focus  or  direct  the  other.  An  interesting  area  for  future  research  involves  using  each  method  to 
learn  different  parts  or  features  of  a  single  concept.  For  example,  imagine  a  system  like  GENESIS 
trying  to  learn  a  schema  for  a  birthday  party  by  reading  narratives  about  particular  celebrations. 
Such  a  system  could  probably  use  its  domain  knowledge  to  explain  the  baking,  cutting,  and  eating 
of  the  birthday  cake  since  these  actions  are  causally  connected  and  satisfy  important  hunger  and 
enjoyment  goals.'  However,  it  would  probably  not  be  able  to  explain  why  someone  put  a  particular 
number  of  candles  on  the  cake  and  why  someone  else  made  a  wish  and  then  blew  them  out  while 
e.eryone  eise  sang.  Howe'er,  these  components  of  the  birthdav-party  schema  might  be  learned 
-smg  a  similarity-based  approach.  Since  many  stereotypical  actions  (e.g.  a  wedding  ceremony  or  a 
-  -  to  the  restaurant  or  supermarket)  contain  both  features  that  are  causally  necessary  and  others 

:  —  t  -esea  rc.-.ers  nay  surges',  mat  ECCS  would  combine  nicely  with  BACON  [LangleyS  1  i;  however,  .'our  out  of  five 
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that  are  conventional  such  ar.  apprcacn  ic  integrating  the  two  methods  could  be  very  useful,  par¬ 


ticularly  in  the  domain  of  narrative  understanding. 


15.3.  Conclusions 


This  thesis  has  shown  that  a  single  general  learning  mechanism  can  perform  explanation-based 
learning  in  a  wide  variety  of  domains.  The  generalization  mechanism  in  the  EGGS  system  has  been 
shown  to  be  able  to  generalize  explanations  based  on  logical  proofs,  term  rewritings,  and  plans  com¬ 
posed  of  STRIPS  operators.  This  generalization  algorithm  was  compared  to  the  generalization  algo¬ 
rithms  in  STRIPS  and  PROLOG-EBG.  Like  EGGS,  both  of  these  algorithms  use  unification  pattern 
matching  to  compute  the  most  general  explanation  that  maintains  the  structure  of  an  explanation 
for  a  specific  example.  Theoretically,  the  time  required  to  compute  a  generalized  explanation  was 
snow-n  to  be  linear  in  the  size  of  the  explanation.  Compared  to  generalization  algorithms  based  on 


unification,  the  generalization  algorithm  in  SOAR  was  shown  to  be  susceptible  to  under¬ 


generalization. 


With  regard  to  logical  proof  explanations,  mechanisms  in  EGGS  for  proof  construction,  proof 
completion,  and  macro-rule  learning  were  presented.  Examples  of  learning  using  logical  proof 
explanations  were  given  for  the  following  domains:  logic  circuit  design  (as  in  the  LEAP  sy'stem). 
logic  theorem  proving  (as  in  MA).  integration  problem  solving  (as  in  LEX2),  geometry  theorem 
proving  (as  in  ACT*  and  PUPS),  and  blocks  world  planning  (as  in  STRIPS). 


Regarding  explanations  composed  of  rewrite  rules,  mechanisms  in  EGGS  for  rewriting  expres¬ 
sions  and  learning  rewrite  macro-rules  were  discussed.  Examples  of  learning  using  term  rewritings 
were  presented  for  the  following  domains:  logic  circuit  design,  integration  problem  solving,  and 


eouation  solving. 


Finally,  regarding  plan-based  explanations  composed  of  STRIPS  operators,  mechanisms  in 
EGGS  for  verifying  plans  and  learning  macro-operators  with  partially  ordered  actions  w'ere 
presented.  The  process  of  learning  partially  ordered  macro-operators  from  specific  plans  was  com¬ 
pared  to  the  task  of  nonlinear  planning.  Examples  of  learning  using  plan-based  explanations  were 
given  for  blocks  world  planning  and  computer  programming. 


Empirical  results  on  the  effect  of  macro-rules  on  problem  solving  performance  were  also 
presented.  These  results  show  that  learning  macro-rules  can  greatly  improve  performance  on  simi¬ 
lar  problems;  however,  in  certain  situations,  it  can  also  substantially  degrade  performance. 
Approaches  to  dealing  with  the  problem  of  degraded  performance  were  reviewed  and  suggestions 
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:cr  conducting  properly  controlled  experimental  comparisons  of  learning  and  non-learning  systems 
'-ere  given. 

The  EGGS  learning  system  has  been  tested  most  thoroughly  as  a  component  of  the  GENESIS 
narrative  understanding  system.  The  overall  architecture  of  GENESIS  was  reviewed  and  the 
importance  of  learning  to  improve  the  performance  of  a  complete  Al  system  was  discussed.  The 
knowledge  representation  used  in  GENESIS  and  its  ability  to  construct  explanations  for  narratives 
vere  discussed  in  detail  using  the  learning  of  a  murder-for-inheritance  schema  as  an  example.  The 
schema  and  plan-cased  understanding  processes  in  GENESIS  were  presented  and  compared  to  those 
in  previous  narrative  understanding  systems. 

It  was  shewn  how  schema  acquisition  in  GENESIS  uses  the  EGGS  system  to  generalize  the 
explanations  constructed  for  characters’  actions  in  a  narrative.  An  additional  generalization  step 
that  insures  that  the  learned  schema  represents  a  volitional  action  and  does  not  contain  any  unmo¬ 
tivated  actions  was  also  discussed.  The  description  of  schema  acquisition  in  GENESIS  also  included 
discussions  of  processes  for  deciding  when  to  learn,  packaging  a  generalized  explanation  into  a 
schema,  and  indexing  a  schema  so  that  it  can  be  used  to  aid  subsequent  understanding. 

In  addition.  GENESIS'  ability  to  iearn  provisional  meanings  for  unknown  schema-reiated 
words  was  reviewed.  Unlike  other  approaches  to  the  acquisition  of  word  meaning.  GENESIS  is  • 
capable  of  simultaneously  learning  a  concept  and  meanings  for  related  words  from  a  single  exam¬ 
ple.  GENESIS’s  ability  to  use  learned  schemata  to  index  and  retrieve  specific  episodes  was  also 
presented. 

Finally,  psychological  experiments  were  reviewed  that  revealed  that  human  subjects,  like 
GENESIS,  could  learn  a  novel  plan  schema  from  a  single  example.  These  experiments  demonstrated 
that  after  reading  a  specific  instance  of  a  novel  plan,  subjects  could  write  a  good  description  of  the 
underlying  schema.  They  could  also  produce  another  instance  of  the  schema  and  answer  questions 
abcut  the  schema  as  well  as  subjects  who  read  a  passage  directly  describing  the  abstract  plan. 

Together  with  other  recent  projects  investigating  general  EBL  mechanisms,  the  work  presented 
in  this  thesis  is  an  indication  that  research  in  EBL  has  reached  a  certain  level  of  "maturity"  and  has 


moved  beyond  the  stage  of  exploratory  programs  that  operate  in  a  single  domain.  Nevertheless,  as 
outlined  in  the  previous  section,  there  continue  to  be  even  more  interesting  issut.  r  the  horizon 
which  are  already  attracting  a  great  deal  of  attention. 
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LINEAR  SUBSTITUTION  APPLICATION 


Although  a  linear-time  unification  algorithm  is  described  in  [Paterson78],  a  linear-time  algo¬ 
rithm  for  applying  a  substitution  to  a  wff  is  required  in  order  insure  that  a  generalized  explanation 
can  be  computed  in  linear  time  (see  section  3.4).  To  insure  linear  performance,  a  substitution 
application  algorithm  must  be  careful  to  avoid  tracing  down  the  final  value  of  a  variable  multiple 
times.  Such  duplication  of  work  can  be  prevented  by  using  path  compression  like  that  performed  in 
the  standard  UNION-FIND  algorithm  [Remgold77j.  One  approach  is  to  first  compress  or  "flatten"  a 
substitution  so  that  every  variable  in  the  substitution  is  directly  bound  to  its  final  value.  Since  a 
flattened  substitution  can  easily  be  applied  to  a  wff  in  linear  time,  if  the  flattening  can  be  per¬ 
formed  in  linear  time,  then  the  complete  process  wili  be  linear. 

As  in  [Paterson7S],  it  is  assumed  that  wffs  are  represented  as  directed  acydical  graphs  in 
which  common  subexpressions  are  represented  by  a  single  subgraph.  A  node  representing  a  Ir-arv 
function  or  predicate  has  outdegree  k  and  sonln,  i)  ( 1  <  t  <  k)  refers  to  the  t'th  son  (argument)  of 
node  n.  A  substitution  will  consist  of  a  list  of  nodes  representing  variables  that  have  pointers  to 
nodes  representing’their  values  (i.e.  Value(v)  refers  to  the  value  of  variable  node  v  and  is  NIL  if  v  is 
not  bound).  It  is  assumed  that  this  substitution  is  generated  by  a  unifier  and  therefore  does  not 
have  any  cccur-check  violations.  Given  this  representation,  an  algorithm  for  applying  a  substitu¬ 
tion  to  a  wff  is  given  in  Figure  A.l. 

The  procedure  FlattenSubstitution  changes  the  substitution  into  one  in  which  variables  point 
directly  to  their  final  values.  This  is  done  by  first  finding  the  final  value  for  each  variable  by  fol¬ 
lowing  value  pointers  and  performing  path  compression  [Reingold77]  so  that  all  variables  encoun¬ 
tered  along  the  way  are  also  made  to  point  to  their  final  values.  Finding  the  final  value  of  any 
variable  will  be  called  a  FIND  since  it  is  analogous  to  the  corresponding  process  in  the  UNION- 
FIND  problem.  Performing  a  FIND  on  all  the  variables  in  the  substitution  is  linear  in  the  size  of 
the  substitution  for  the  following  reasons.  Let  n  be  the  number  of  variables  in  the  substitution 
(the  totai  number  of  value  pointers)  and  let  p  be  the  total  number  of  value  pointers  followed  by 
doing  a  FIND  on  all  variables.  Every  FIND  requires  following  exactly  one  last  pointer  to  a  value 
that  is  either  an  unbound  variable  or  not  a  variable  at  all  (let  such  pointers  be  called  fined  pointers). 
Consequently,  following  final  pointers  accounts  for  n  of  the  total  number  of  pointer  references. 
Let  o  be  the  total  number  of  pointers  traversed  that  are  not  final  pointers.  Therefore,  p  ■=  n  -  o. 
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let  S  be  the  substitution  and  w  be  the  wff  to  which  it  is  to  be  applied 
FlattenSubstitution(S) 

ReplaceVanables(  w) 

procedure  FlattenSubstitution(S) 
for  variable  v  in  S  do 
let  A  be  an  empty  stack 

■while  v  is  a  variable  node  and  Vaiue(v)5A\[L  do 
push(v.  A) 
let  v  =  Value!  v) 
while  A  is  nonempty  do 
Value(pop(A))  =  v 
for  variable  v  in  S  do 

if  Value(v)  is  not  marked  as  already  visited  then 
mark  Value! v)  as  visited 
ReplaceVariables(Value(v)) 

procedure  ReplaceVariabies(n) 

if  n  is  a  node  for  a  variable  or  a  constant 

then  return 
else 

for  i  from  1  to  outdegree(n)  do 

if  son(n.i)  is  a  variable  node  and  Value(son(n.j))?*XJL 
then  let  son(n.i)  =  Value(son(n.i)) 
else  ReplaceVariables(son(n.i)) 

Figure  A.l:  Linear  Substitution  Application  Algorithm 


Each  of  the  o  non-final  pointers  followed  in  a  FIX'D  results  in  that  pointer  being  deleted  and  being 
replaced  by  a  final  pointer.  Since  the  initial  number  of  non-final  pointers  is  clearly  less  than  n  (the 
total  number  of  pointers),  there  are  at  most  n  non-final  pointers  to  be  deleted  and  hence  o  ^  n. 
Consequently,  p  ^  2n  and  therefore  finding  the  final  values  for  all  the  variables  takes  only  linear 
time. 

The  procedure  ReplaceVariabies  simply  traverses  a  wff  and  replaces  any  bound  variables 
encountered  with  their  previously  computed  final  value.  Since  traversing  a  graph  is  linear  in  the 
size  of  the  graph,  this  process  is  clearly  linear  in  the  size  of  the  input  wff.  After  the  final  values  for 
ail  variables  has  been  computed.  ReplaceVariabies  is  called  on  all  the  final  values  in  order  to  replace 
any  variables  within  them  with  their  final  values.  Values  are  marked  when  they  are  traversed  to 
avoid  traversing  them  more  then  once.  Since  ReplaceVariabies  is  linear,  the  process  of  replacing 


variables  in  values  is  also  linear  with  respect  to  the  size  cf  the  overall  substitution.  I  his  nr.ishes 
the  process  of  "flattening"  and  all  variables  are  now  bound  to  their  completely  instantiated  final 
values.  Calling  ReplaceYariables  on  the  input  wff  will,  in  linear  time,  replace  all  variables  with 
their  final  instantiated  values.  Since  "flattening"  is  linear  in  the  size  of  the  substitution  and  since 
replacing  variables  in  the  input  wff  is  linear  in  the  size  of  the  wff.  the  algorithm  in  Figure  A. 2  is 
linear  in  the  size  of  its  innuts. 
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ADDITIONAL  GENESIS  EXAMPLES 

In  chapters  9-11  a  detailed  description  was  given  of  how  GENESIS  learns,  indexes,  and  subse¬ 
quently  uses  a  murder-for-inheritance  schema.  In  addition,  in  chapter  2.  a  trace  of  GENESIS’  I/O 
behavior  was  given  for  the  learning  of  a  kidnap-for-ransom  schema.  This  appendix  provides 
further  details  on  the  kidnapping  example  as  well  as  detailed  descriptions  of  two  additional 
GENESIS  examples.  The  two  additional  examples  include  learning  a  schema  for  a  person  burning 
their  own  property  in  order  to  collect  the  insurance  money  and  learning  a  schema  for  a  police- 
officer  impersonating  a  prostitute  in  order  to  arrest  potential  customers.  Definitions  for  all  of  the 
schemata  and  rules  needed  to  process  these  examples  are  given  in  appendix  C. 

B.l.  The  Kidnap  Example 

This  section  presents  additional  information  on  the  kidnapping  example.  The  test  and  learning 
narratives  as  well  as  a  system  trace  for  this  example  were  presented  in  chapter  2. 

B.'.I.  Processing  the  Test  Narrative  Before  Learning 

The  parsed  version  of  the  test  narrative  for  the  kidnap  example  is  shown  in  Figure  B.l.  When 
processing  this  narrative  for  the  first  time  given  only  the  knowledge  in  appendix  C.  GENESIS  con¬ 
structs  the  causal  model  shown  in  Figure  B.2.  Except  for  connecting  Alice's  imprisonment  to  her 
subsequent  release,  relatively  little  causal  structure  is  built.  The  fact  that  Bob  locked  Alice  in  his 
basement  (which  is  parsed  as  a  Confine  action)  suggests  a  Capture  schema  and  this  schema  is 


(Isa  Personl  Person)  (Gender  Personl  male)  (Name  Personl  Ted)  (Isa  Person2  Person) 
(Name  Person2  Alice)  (Gender  Person2  female)  (Husband  Personl  Persor.2) 

(Isa  Monevl  Money)  (Amount  Moneyl  1 00000  dollar)  (Isa  Lottervl  Lottery) 

(Win  Personl  Moneyl  Lottery  1)  (Isa  Person3  Person)  (Gender  Persor.3  male) 

(Name  Person3  Bob)  (Isa  Basemenil  Basement)  (Isa  Locationl  Location) 

(In  Locationl  Basemenil)  (Confine  Person3  Person2  Locationl  Basemenil) 

(Residence  Person3  Basemenil)  (Isa  Money2  Money)  (Amount  Money2  75000  dollar) 
f. Possess  Person3  Money2)  (Release  Person3  Person2  7FRO.M6  7IN12) 


Figure  B.l:  Parsed  Version  of  the  Kidnap  Test  Narrative 
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Gender  1  Gender2  Namel  Name3  Amo:int2  Residencel 

Isa28  Gender3  Name2  Amountl  Possess4 


Free-i 

Alice  is  free. 

Winl 

Ted  won  the  $100000  in  the  lottery. 

Reieasel 

Bob  released  Alice. 

Isa8 

The  lottery  is  a  game. 

Capnvel 

Alice  was  Bob's  captive. 

Isa4 

The  lottery  is  a  lottery. 

Capture! 

Bob  captured  Alice. 

IsalO 

The  S100000  is  an  inanimate  object. 

Ini 

In  the  basement  is  in  the  basement. 

Isai; 

The  S100000  is  a  valuable. 

Isa:: 

The  basement  is  a  room. 

Isa3 

The  S100000  is  money. 

Isa  18 

The  basement  is  a  basement. 

Isa  14 

Ted  is  a  character. 

Isa  19 

In  the  basement  is  a  location. 

Isal 

Ted  is  a  person. 

Isa: 

Alice  is  a  person. 

PositivelPTl 

Ted  has  a  positive  relationship  with  Alice. 

Isa:0 

Bob  is  a  character. 

Spousel 

Ted  is  Alice's  spouse. 

Isal  * 

Bob  is  a  person. 

Husbandl 

Ted  is  Alice's  husband. 

Free: 

Alice  vas  f  ree. 

Isa28 

The  $75000  is  money. 

Noil 

Alice  was  not  free. 

Gender  1 

Ted  is  male. 

Confinel 

Bob  confined  Alice  in  the  basement. 

Gender: 

Alice  is  female. 

At; 

Alice  is  in  the  basement. 

Gender3 

Bob  is  male. 

P'.ransl 

Bob  moved  Alice  into  the  basement. 

Namel 

Ted's  name  is  Ted. 

At  1 

Alice  is  at  some  place. 

Name; 

Alice’s  name  is  Alice. 

Firans: 

Bob  moved  Alice. 

Name3 

Bob's  name  is  Bob. 

PossessU 

Bob  has  the  S75000. 

Believel 

Bob  believes  that  Alice  is  Bob's  captive. 

Not4 

Alice  is  not  Bab’s  captive. 

Amountl 

The  S 100000  is  100000  dollars  of  money. 

Possess: 

Ted  has  the  $100000. 

Amount: 

The  S75000  is  75000  dollars  of  money. 

Residencel 

Bob  lives  in  the  basement. 

Figure  B.2:  Causal  Model  for  Kidnap  Test  Example  Before  Learning 


immediately  determined  since  the  Confine  step  is  causally  supported  by  all  of  the  other  steps  in 
Capture.  Bob’s  release  of  Alice  is  connected  to  his  previous  Capture  since  the  Release's  precondition 
of  having  Alice  held  captive  is  achieved  by  the  Confine  action  in  the  Capture.  As  shown  in  Figure 
B.2.  the  fact  that  Bob  has  $75,000  is  not  connected  to  the  rest  of  the  actions  in  the  narrative. 


Due  to  this  impoverished  representation  of  the  narrative,  the  system  is  unable  to  answer  the 
questions  presented  in  chapter  2.  This  example  originally  revealed  an  interesting  problem  with  an 
early  version  of  GEXESIS’  question-answering  system.  When  asked:  "Why  did  Bob  lock  Alice  in 
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his  basement."  it  produced  the  reply:  "So  Bob  could  release  Alice."  It  judged  this  to  be  a  reason¬ 
able  answer  because  one  of  its  heuristics  for  answererir.g  a  why  questions  aoout  an  action  was  to 
find  a  later  action  that  it  enabled.  This  heuristic  was  later  refined  to  eliminate  enabled  actions  that 
simply  re-achieve  states  that  were  already  true  before  the  execution  of  the  action  in  question.  This 
eliminates  the  answer:  “So  Bob  could  release  Alice"  since  this  release  only  achieves  the  state  of 
having  Alice  free,  which  was  already  true  before  he  locked  her  in  the  basement. 

B.1.2.  Processing  the  Learning  N  rrative 

The  parsed  version  of  the  learning  narrative  for  the  kidnap  example  is  shown  in  Figure  B.3 
While  processing  this  narrative.  GENESIS  builds  a  very  large  and  interconnected  causal  structure. 
Pieces  of  this  causal  structure  are  shown  in  Figures  B.4  -  B.S.  Figure  B.4  shows  the  explanation  for 
why  Mary  got  in  the  car.  This  explanation  can  be  summarized  as  follow's.  John's  action  of 


(Isa  Person?  Person)  (Gender  Person?  male)  (Name  Person7  Fred) 

(Isa  Persons  Person)  (Name  PersonS  Mary)  (Gender  PersonS  female) 

(Isa  Monev3  Money)  (Amount  Monev3  order-millions  dollar) 

(Father  Person7  PersonS)  (Possess  Person7  Money3)  (Isa  Person9  Person) 

(Gender  Person9  male)  (Name  Person9  John)  (Isa  Location2  Location) 

(At  PersonS  Location2)  (Ptrans  Person9  Person9  7FROM11  Location2) 

(Isa  Gunl  Gun)  (Aim  Person9  Gunl  PersonS)  (Color  Jeansl  blue)  (Isa  Jeansl  Jeans) 
(Attire  PersonS  Jeansl)  (Isa  Carl  Car)  (Isa  Location3  Location)  (In  Location3  Carl) 
(Mtrans  Person9  (Implies  (Not  (Ptrans  Person8  PersonS  7FROM61  Location3)) 

(Shoot  Person9  PersonS  7INSTRUMENT9))  Person8) 

(Possess  Person9  Carl)  (Isa  Hoteil  Hotel)  (Isa  Location4  Location) 

(At  Hoteil  Location4)  (Drive  Person9  PersonS  7FROM116  Location4  "’YEHICLE10) 
(Residence  Person9  Hoteil)  (Isa  Rooml  Room)  (Isa  Location5  Location) 

(In  Location5  Rooml)  (Confine  Person9  PersonS  Locatior.5  Rooml) 

(Residence  Person9  Rooml)  (DialTelephone  Person9  Person7  7NLMBER36) 

(Mtrans  Person9  (Captive  PersonS  Person9  7LOC15  7IN18)  Person7)  (Isa  Monev4  Money) 
(Amount  Monev4  250000  dollar)  (Isa  Restaurantl  Restaurant) 

(Name  Restaurantl  Trenos)  (Isa  Locations  Location)  (At  Restaurantl  Location6) 

(Mtrans  Person9  (Implies  (Atrans  Person7  Money4  Person9  Locations) 

(Release  Person9  PersonS  7FROM500  7IN92))  Person7) 

(Atrans  Person?  '’PhvsicalObject  1  Person9  7AT24) 

(Release  ?PhysicalObject2  PersonS  7FROM633  7IN169)  (Isa  Person22  Person) 

(Gender  Person22  female)  (Name  Person22  Valerie)  (Husband  Person7  Person22) 

(Mtrans  Person?  (?Actionl  7Personl  Person8)  Person22) 

Figure  B.3:  Parsed  Version  of  the  Kidnap  Learning  Narrative 
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approacnmg  Mary  puis  ihem  boih  in  ihe  sarr.e  location  and  enables  him  to  aim  his  gun  at  her  John 
telling  Mary  that  he  would  shoot  her  if  she  didn  T  gel  in  the  car  suggests  a  Threaten  schema.  Since 
John  driving  Mary  to  the  hotel  requires  as  a  precondition  that  Mary  be  in  the  car  and  since  this 
state  is  the  effect  of  the  successful  completion  of  the  Threaten,  the  Threaten  schema  is  determined. 
This  in  turn  adds  the  assertion  that  Mary  got  in  the  car  because  she  didn't  want  to  be  shot  and 
causes  the  effect  of  the  Aim  to  be  equated  to  a  precondition  of  the  Threaten. 

The  remaining  explanation  for  how  Mary  became  John's  captive  (an  expansion  of  the  Capture 
schema)  is  shown  in  Figure  B.5.  which  can  be  summarized  as  follows.  John  driving  Mary  to  the 
hotel  brings  them  all  to  the  hotel  s  location.  John  confining  Mary  in  the  hotel  room  suggests  a  com¬ 
plete  Capture  schema,  which  is  immediately  determined  since  the  Confine  action  is  supported  by  all 
the  other  actions  in  Capture.  This  determination  adds  the  assertion  that  John  must  have  moved 
Marv  into  the  room,  which  in  turn  is  connected  to  the  fact  that  she  was  already  at  the  hotel. 
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The  complete  highest-level  explanation  for  how  John  got  the  S100.000  is  shown  in  Figure  B.6 
and  car.  be  summarized  as  follows.  John  calling  Fred  suggests  a  Telephone  schema,  which  is  subse¬ 
quently  determined  when  John  tells  Fred  he  is  holding  Mary  captive.  This  Telephone  instance  is 
possible  because  John  believes  that  he  is  holding  Mary  captive  as  an  effect  of  the  Capture.  It  should 
be  noted  that  GENESIS  has  a  very  naive  view  of  communication  in  which  believing  a  proposition  is 
a  precondition  for  communicating  it  and  m  which  everyone  believes  everything  that  they  are  told. 
Next.  John  telling  Fred  that  he  would  release  Mary  if  Fred  gave  him  S100.000  suggests  a  Bargain 
schema,  which  is  determined  when  both  of  these  actions  take  place.  Preconditions  of  the  completed 
Bargain  include  that  John  have  Mary  held  captive  (which  matches  an  effect  of  the  Capture),  that 
Fred  have  the  $100.00  (which  is  inferred  from  the  fact  that  he  is  a  millionaire),  that  Fred  believe 
that  John  has  Mary  (which  matches  an  effect  of  the  Telephone),  and  that  Fred  value  Mary's  free¬ 
dom  more  than  $100,000  (which  is  inferred  from  the  fact  that  he  is  her  father).  Figure  B.7  shows 
the  expansion  of  the  Bargain  schema  and  Figure  B.S  shows  all  the  facts  in  the  narrative  that  remain 
unconnected  to  the  rest  of  the  text.  The  causal  model  constructed  for  this  narrative  and  illustrated 
in  Figures  B.4  -  B.S  allows  GENESIS  to  answer  the  wide  variety  of  questions  presented  in  the  trace 
in  chapter  2. 

At  this  point  it  is  appropriate  to  note  that  the  Bargain  and  Threaten  schemata  are  more  com¬ 
plicated  to  define  than  the  other  schemata  in  GENESIS.  This  is  because  the  effects  of  these  schemata 
depend  on  the  actions  that  are  involved.  Consequently,  a  special  construct  (‘Pointer*)  is  used  m 
the  definitions  of  these  schemata  in  order  to  indirectly  reference  the  effects  and  preconditions  of 
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Mary  x-as  in  the  car. 

John  threatened  to  shoot  Mary  x-ith  the  gun  unless  Mary  x-ent  from  John  into  the  car. 
Mary  is  a  character. 

Mary  is  a  person. 

John  is  a  character. 

John  is  a  person. 

Mary  x-ants  not  to  be  shot. 

Mary  x-as  at  John. 

John  is  pointing  the  gun  at  Mary. 

John  aimed  the  gun  at  Mary. 

Mary  is  a  physical  object. 

Mary  is  an  animate  object. 

The  gun  is  a  physical  object. 

The  gun  is  an  inanimate  object. 

The  gun  is  a  x-eapon. 

The  gun  is  a  gun. 

John  has  the  gun. 

John  has  a  line  of  sight  path  to  Mary. 

John  is  a:  Mary. 

John  x-ent  to  Mary. 

Mary  is  a  location. 

John  is  a  location. 

John  is  a  physical  object. 

John  is  an  animate  object. 

John  x-as  at  some  place. 

John  is  not  at  some  place. 

Mary  is  not  at  John. 

Mary  ■'rent  from  John  into  the  car. 

Mary  believes  that  if  Mary  does  not  go  from  John  into  the  car  then  John  x-ill 
shoot  Mary  x-ith  the  gun. 

John  told  Mary  that  if  Mary  did  not  go  from  John  into  the  car  then  John  x-ould 
shoot  Mary  x-uh  the  gun. 

Figure  B.4:  Kidnap  Example  —  Specific  Explanation  (Threaten) 
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Captivel 

Mary  u-as  John's  captive. 

In3 

In  the  room  is  in  the  room. 

Isa48 

The  room  is  a  room. 

Isa49 

In  the  room  is  a  location. 

Isa2 

Mary  is  a  person. 

Isa  15 

John  is  a  character. 

Isa4 

John  is  a  person. 

Free2 

Mary  u-as  free. 

Not  13 

Mary  u-as  not  free. 

Confinel 

John  confined  Mary  in  the  room. 

At  26 

Mary  is  in  the  room. 

Ptrans5 

John  moved  Mary  from  the  hotel  into  the  room. 

At23 

Mary  is  at  the  hotel. 

Ptrans3 

John  moved  Mary  from  the  car  to  the  hotel. 

Drivel 

John  drove  Mary  to  the  hotel  in  the  car. 

Isa41 

The  car  is  a  vehicle. 

Isa34 

The  car  is  a  car. 

At  18 

The  car  u-as  at  Mary. 

Ini 

In  the  car  is  in  the  car. 

At9 

Mary  u-as  in  the  car. 

At  20 

John  u-as  in  the  car. 

Possess4 

John  has  the  car. 

Atl6 

John  is  at  the  hotel. 

At  17 

The  car  is  at  the  hotel. 

Not7 

John  is  not  in  the  car. 

Not8 

Mary  u-as  not  in  the  car. 

Not9 

The  car  is  not  at  Mary. 

Isa39 

The  hotel  is  a  location. 

Isa47 

The  car  is  a  location. 

Isa2 1 

Mary  is  a  physical  object. 

Isa23 

Mary  is  an  animate  object. 

Isa25 

Mary  is  a  character. 

Isal  3 

John  is  an  animate  object. 

At24 

Mary  u-as  at  the  car. 

Not  12 

Mary  is  not  at  the  car. 

Believe2 

John  believed  that  Mary  u-as  John's  captive. 

Figure  B-5:  Kidnap  Example  —  Specific  Explanation  (Capture! 
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Fred  is  a  character. 

Isal 

Fred  is  a  person. 

Isa  15 

John  is  a  character. 

lsa4 

John  is  a  person. 
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Isa2 

Mary  is  a  person. 

Isa3 

Millions  of  dollars  is  money 

Isa66 

The  S25000C  is  a  valuable. 

Isa58 

The  S250C00  is  money. 

Isa  4  8 

The  room  is  a  room. 

Isa49 

In  the  room  is  a  location. 
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John  is  happy  that  John  has  the  S250000. 

John  has  the  $250000. 

John  and  Fred  carried  out  a  bargain  in  which  John  released  Mary  and 
Fred  gave  John  the  S250000  at  Trenos. 

John  wants  to  have  the  $250000  more  than  he  wants  Mary  to  be  John's  captive. 
Fred  believed  that  Mary  was  John's  captive. 

John  contacted  Fred  and  told  him  that  Mary  was  John's  captive. 

John  called  Fred  and  told  him  that  Mary  was  John's  captive. 

John  knows  that  Fred  has  phone  number  something. 

John  believed  that  Mary  was  John's  captive. 

John  captured  Mary  and  locked  her  in  the  room. 

In  the  room  is  in  the  room. 

Mary  was  f  ree. 

Mary  was  John's  captive. 

Mary  was  not  free. 

Fred  wants  Mary  to  be  free  more  than  he  wants  to  have  the  S250000. 

Fred  has  a  positive  relationship  with  Mary. 

Fred  is  Mary's  parent. 

Fred  is  Mary's  father. 

Fred  had  the  S250000. 

250000  is  less  than  order-millions. 

The  S 25 0000  is  250000  dollars  of  money. 

Fred  has  millions  of  dollars. 

Millions  of  dollars  is  order-millions  dollars  of  money. 

Mary  is  f  ree. 

Fred  does  not  have  the  S250000. 

Mary  is  not  John's  captive. 

Fred  does  not  believe  that  Mary  is  John's  captive. 

John  does  not  believe  that  Mary  is  John's  captive. 


Figure  B.6:  Kidnap  Example  —  Specific  Explanation  (Highest-Level) 
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John  has  the  S250000. 

John  wants  to  have  the  S250000  more  than  he  wants  Mary  to  be  John's  captive. 
Fred  wants  Mary  to  be  free  more  than  he  wants  to  have  the  S250000. 

Fred  has  a  positive  relationship  with  Mary. 

Fred  is  Mary's  parent. 

Fred  is  Mary's  father. 

Fred  had  the  S250000. 

250000  is  less  than  order-millions. 

Millions  of  dollars  is  money. 

The  S250000  ;s  money. 

The  S250000  is  250000  dollars  of  money. 

Fred  has  millions  of  dollars. 

Millions  of  dollars  is  order-millions  dollars  of  money. 

Mary  was  John's  captive. 

Mary  is  free. 

John  released  Mary. 

John  believes  that  if  John  released  Mary  then  Fred  will  give  John  the 
S250000  at  Trenos. 

Mary  is  not  John's  captive. 

Fred  does  not  have  the  S250000. 

Fred  gave  John  the  S250000  at  Trenos. 

Fred  believes  that  if  Fred  gave  John  the  S250000  at  Trenos  then  John 
will  release  Mary. 

John  told  Fred  that  if  Fred  gave  John  the  $250000  at  Trenos  ".her.  John 
would  release  Mary. 

Figure  B.7:  Kidnap  Example  —  Specific  Explanation  (Bargain) 
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Residencel 

Color  1 

AtlO  Xamel  Name*! 

Gender  1 

V 

Residence2 

Attire  1 

At27  Name2  Isa.59 

Gender2 

V 

Isa  3  3 

Isa  3d 

Isa3S  Name3  Isa60 

Gender3 

V 

/i 

Isa  3  3 

The  jeans  are  jeans. 

Isa35 

In  the  car  is  a  location. 

lsa3S 

The  hotel  is  a  hotel. 

s' 

Isa59 

Trenos  is  a  restaurant. 

lsa60 

Trenos  is  a  location. 

Gender 1 

Fred  is  male. 

K 

Gender2 

Mary  is  female. 

S  « 
si 

Gender3 

John  is  male. 

Xamel 

Fred's  name  is  Fred. 

-T* 

\ame2 

Mary’s  name  is  Mary. 

A 

A 

Yame3 

John's  name  is  John. 

\ame4 

Trenos's  name  is  Trenos. 

. 

AtlO 

The  hotel  is  at  the  car. 

V 

At27 

Trenos  is  at  some  place. 

. 

Color  1 

The  jeans  are  blue. 

Attire  1 

Mary  is  wearing  the  jeans. 

t  * 

Residencel 

John  lives  in  the  hotel. 

l< 

Residence2 

John  lives  in  the  room. 

Figure  B.8:  Unconnected  Facts  in  the  Kidnap  Example 


other  actions.  For  example,  if  a  particular  instance  of  a  Bargain  involves  actor  X  doing  action  A  in 
exchange  for  actor  Y  doing  action  B.  then  the  effects  and  preconditions  of  the  overall  Bargain 
include  the  effects  and  preconditions,  respectively,  of  actions  A  and  B.  In  addition,  in  order  for  the 
Bargain  to  be  motivated.  X  should  value  the  facts  added  by  action  B  over  those  deleted  by  action  A. 
and  Y  should  value  the  facts  added  by  A  over  those  deleted  by  B.  For  further  information,  the 
interested  reader  is  referred  to  the  complete  definitions  of  Bargain  and  Threaten  given  in  section  1 
of  appendix  C. 


B.1.3.  Learning  CaptureBargain 

After  the  Bargain  schema  is  determined  in  the  kidnapping  narrative,  the  system  detects  that 
John  has  achieved  a  thematic  goal  of  possessing  a  valuable  item.  Since  this  goal  is  achieved  by 
novel  combination  of  volitional  actions  by  the  main  character,  the  system  proceeds  to  learn  a  new 
schema  from  ns  explanation  of  how  the  goal  was  achieved.  Prior  to  generalization,  the  GEYES1S 


pruning  algorithm  removes  a  number  of  units  from  the  explanation  shown  in  Figure  B.6.  In  addi¬ 
tion  to  pruning  a  few  Isa  inferences,  a  number  of  interesting  generalizations  are  made.  The  fact 
that  Telephone  was  the  particular  specialization  of  Communicate  used  in  the  explanation  is 
removed  since  all  of  its  effects  that  support  the  goal  are  inherited  from  Communicate.  Also,  the 
abstraction  inferences  from  Father  to  Parent  and  from  Parent  to  PositivelPT  are  also  removed  since 
the  Father  and  Parent  relationships  only  support  the  goal  through  the  more  abstract  PositivelPT 
relationship.  The  final  generalized  explanation  EGGS  produces  is  shown  in  Figure  B.9. 

The  packaging  of  this  generalized  explanation  into  a  schema  does  not  detect  any  protection 
violations  and  results  in  the  final  definition  shown  in  Figure  B.10.  The  English  summary  the  para- 
phraser  produces  for  this  schema  was  presented  in  chapter  2.  English  translations  of  the  sugges¬ 
tions  used  to  index  this  new  schema  were  also  given  in  the  trace  in  chapter  2.  The  formal  versions 
of  these  suggestions  are  given  below: 

((Capture  7x55  ?b9  ?c4  ?r5)  —  *'->  (CaptureBargain  °x55  7a34  ?b9  ?c4  ?r5  ?v5  ?1 1 1 )) 
((Communicate  7x55  ?a34  (Captive  ?b9  7x55  ?c4  7r5))  ~> 

(CaptureBargain  7x55  ?a34  ?b9  ?c4  ?r5  7v5  7111)) 

((Bargain  ?xf.5  ?a34  (Release  7x55  ?b9  7c4  ?r5)  (Atrans  ?a34  °v5  7x55  ?lll))) 

(CaptureBargain  7x55  ?a34  ?b9  7c4  ?r5  7v5  ?lll)) 

B.1.4.  Processing  the  Test  Narrative  After  Learning 

When  GENESIS  processes  the  test  kidnapping  narrative  after  learning  the  CaptureBargain 
schema,  the  Capture  schema  determined  from  Bob's  locking  Alice  in  his  basement  suggests  that  the 
new  schema  may  be  useful  in  understanding  the  narrative.  Bob's  acquisition  of  S75.00  and  Alice's 
restored  freedom  confirms  all  of  the  positive  effects  of  CaptureBargain  and  the  schema  is  deter¬ 
mined.  The  resulting  explanation  for  Bob’s  possession  of  the  money  is  shown  in  Figure  B.ll.  This 
explanation  allows  the  system  to  answer  the  questions  it  could  not  answer  previously  and  the  Cap¬ 
tureBargain  schema  provides  a  reasonable  paraphrase  of  the  text. 

Other  previously  incomprehensible  narratives  can  also  be  understood  using  the  new  schema. 
An  interesting  variation  of  the  test  narrative  that  results  in  the  same  final  explanation  is  the  even 
sketchier  account  shown  below: 

Ted  is  Alice's  husband.  He  won  a  S100000  in  the  lottery.  Someone  imprisoned  Alice  in  a  base¬ 
ment.  Bob  got  S75000.  Someone  released  Alice. 
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V. 
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Possess  6 
GoalPriorityJ 
Captivel 

Communicatcl  — *■  Bel 


Bargainl 


ThetneGoalMe’2 


ThemeGoalMet2 

7x55  is  happy  that  7x55  has  ?v5. 

Isa6S 

7v5  is  a  valuable. 

Possess  7 

7x55  has  ?y5. 

:S 

Bargainl 

?x55  and  ?a34  carried  out  a  bargain  in  which  ?x55  released  ?b9  and 
?a34  gave  7x55  ?v5  at  ?111. 

-f 

GoalPrioritv2 

7x55  wants  to  have  ?y 5  more  than  he-'she  wants  ?b9  to  be  7x55‘s  captive. 

Isa69 

7a34  is  a  character. 

Isa  70 

7x55  is  a  character. 

Beiieve5 

?a34  believed  that  ?b9  was  ?x55‘s  captive. 

- 

Communicatel 

7x55  contacted  ?a34  and  told  him/her  that  ?b9  was  ^xSS  s  captive. 

* .. 

Believe2 

7x55  believed  that  ?b9  was  7x55’ s  captive. 

Capture 1 

7x55  captured  7b9  and  locked  him/her  in  0r 5. 

In4 

?c4  is  in  ?r5. 

ti 

Isa53 

7r5  is  a  room. 

Isa52 

?c4  is  a  location. 

Isa51 

?b9  is  a  person. 

'-r. 

Freel 

°b9  was  free. 

Captivel 

?b9  was  ?x55's  captive. 

Not  13 

?b9  was  not  free. 

1 

GoaIPrioritv3 

7a34  wants  7b9  to  be  free  more  than  he/'she  wants  to  have  ?v5. 

V  , 

PositiveIPT2 

?a34  has  a  positive  relationship  with  7b9. 

Possess6 

°a34  had  ?v5. 

£ 

Free3 

7b9  is  f  ree. 

< 

Not  15 

?a34  does  not  have  ?y5. 

Not  16 

°b9  is  not  7x55’s  captive. 

Jr 

Not  17 

7a34  does  not  believe  that  ?b9  is  7x55's  captive. 

Not  18 

7x55  does  not  believe  that  ?b9  is  7x55's  captive. 

.v 

.v 

Figure  B.9:  Kidnap  Example  —  Generalized  Explanation 
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*(CaptureBargain  7x55  °a34  ?b9  ?c4  ?r5  °v5  ?lll)  "'’xSS  kidnapped  ?b9  M 
(Constraint  (Isa  ?b9  Person)  (Isa  ?c4  Location)  (Isa  ?r5  Room)  (In  ?c4  7r5) 

(Isa  7x55  Character)  (Isa  ?a34  Character)  (Isa  ?y 5  Valuable)) 

(Precondition  (Possess  ?a34  ?y5)  (PositivelPT  ?a34  ?b9)  (Free  ?b9)) 

(Motivation  (GoalPrioritv  7x~55  (Possess  7x55  ?v5)  (Captive  ?b9  7x55  ?c4  ?r5))) 

(Effect  (Not  (Believe  7x55  (Captive  ?b9  7x55  ?c4  ?r5))) 

(Not  (Believe  ?a34  (Captive  ?b9  7x55  ?c4  7r5)))  (Not  (Captive  ?b9  7x55  7c4  ?r5)) 

(Not  (Possess  9a34  ?y5))  (Free  ?b9)  (Possess  ’’xSS  ?v5)) 

(Subaction  (Capture  7x55  ?b9  ^4  ?r5)  (Communicate  ^xSS  ?a34  (Captive  ?b9  '1x55  ?c4  ?r 5)) 
(Bargain  '’xSS  ?a34  (Release  0x55  ?b9  ?c4  ?r5)  (Atrans  ?a34  ?v5  7x55  ?lll))) 

(Internal  (Not  (Free  ?b9))  (Captive  ?b9  7x55  ?c4  ?r5) 

(GoalPrioritv  9a34  (Free  ?b9)  (Possess  ?a34  ?v5)) 

(Believe  9a34  (Captive  ?b9  7x55  ?c4  ?r5))  (Believe  7x55  (Captive  ^9  7x55  ?c4  ?r5))) 
(Links  (Antecedent  (Internal  3)  (Precondition  2))  (Precondition  (Subaction  1)  (Precondition  3)) 
(Constraint  (Subaction  l)  (Constraint  4))  (Constraint  (Subaction  l)  (Constraint  3)) 
(Constraint  (Subaction  l)  (Constraint  2))  (Constraint  (Subaction  1)  (Constraint  1)) 
(Constraint  (Subaction  l)  (Constraint  5))  (Effect  (Subaction  1)  (Internal  5 )) 

(Effect  (Subaction  l)  (Internal  1))  (Effect  (Subaction  l)  (Internal  2)) 

(Precondition  (Subaction  2)  (Internal  5))  (Constraint  (Subaction  2)  (Constraint  6)) 
(Constraint  (Subaction  2)  (Constraint  5))  (Effect  (Subaction  2)  (Internal  4)) 

(Precondition  (Subaction  3)  (Internal  4))  (Precondition  (Subaction  3)  (Internal  3)) 
(Precondition  (Subaction  3)  (Precondition  1))  (Precondition  (Subaction  3)  (Internal  2)) 
(Motivation  (Subaction  3)  (Motivation  1))  (Constraint  (Subaction  3)  (Constraint  6)) 
(Constraint  (Subaction  3)  (Constraint  5))  (Effect  (Subaction  3)  (Effect  1)) 

(Effect  (Subaction  3)  (Effect  2))  (Effect  (Subaction  3)  (Effect  3)) 

(Effect  (Subaction  3)  (Effect  4))  (Effect  (Subaction  3)  (Effect  5)) 

(Effect  (Subaction  3)  (Effect  6))) 

Figure  B.10:  CaptureBargain  Schema  Definition 
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Isa35 

The  $75000  is  a  valuable. 

Isa  34 

The  $75000  is  money. 

Isal 

Ted  is  a  person. 

Isal4 

Ted  is  a  character. 

Isa  17 

Bob  is  a  person. 

Isa20 

Bob  is  a  character. 

lsa22 

The  basement  is  a  room. 

Isa  18 

The  basement  is  a  basement. 

Isa  19 

In  the  basement  is  a  location. 

!sa2 

Alice  is  a  person. 

Isa8 

The  lottery  is  a  game. 

Isa4 

The  lottery  is  a  lottery. 

Isa3 

The  $100000  is  money. 

Isa  10 

The  S100000  is  an  inanimate  object. 

£ 

Isa  12 

The  S100000  is  a  valuable. 

ThemeGoalMet3 
Possess5 
Bargain  1 

GoalPrioritv3 

Captive  1 

Capturel 

Ini 

Freel 

Notl 

Believe! 

PossessS 

LessThanl 

Amount2 

Possess2 

W'inl 

A  mount  1 

GoalPriority2 

PositivelPTl 

Spouse  1 

Husband  1 

Believe3 

Communicatel 

Not  3 

NoU 

NotS 

Not6 

Free  3 


Bob  is  happy  that  Bob  has  the  S75000. 

Bob  has  the  $75000. 

Bob  and  Ted  carried  out  a  bargain  in  which  Bob  released  Alice  and 
Ted  gave  Bob  the  $75000. 

Bob  wants  to  have  the  S75000  more  than  he  wants  Alice  to  be  Bob’s  captive. 
Alice  was  Bob’s  captive. 

Bob  captured  Alice  and  locked  her  in  the  basement. 

In  the  basement  is  in  the  basement. 

Alice  is  f  ree. 

Alice  was  not  free. 

Bob  believed  that  Alice  was  Bob’s  captive. 

Ted  had  the  S75000. 

75000  is  less  than  100000. 

The  S75000  is  75000  dollars  of  money. 

Ted  has  the  $100000. 

Ted  won  the  S100000  in  the  lottery. 

The  $100000  is  100000  dollars  of  money. 

Ted  wants  Alice  to  be  free  more  than  he  wants  to  have  the  S75000. 

Ted  has  a  positive  relationship  with  Alice. 

Ted  is  Alice's  spouse. 

Ted  is  Alice's  husband. 

Ted  believed  that  Alice  was  Bob's  captive. 

Bob  contacted  Ted  and  told  him  that  Alice  was  Bob  s  captive. 

Bob  does  not  believe  that  Alice  is  Bob's  captive. 

Ted  does  not  believe  that  Alice  is  Bob's  captive. 

Alice  is  not  Bob's  captive. 

Ted  does  not  have  the  S75000. 

Alice  is  f  ree. 


£ 

■f. 


Figure  B.ll:  Kidnap  Test  Example  After  Learning  —  Specific  Explanation 


The  processing  of  this  narrative  using  CaptureBargain  is  basically  the  same  as  the  processing  of  the 
test  narrative  except  for  the  fact  that  Bob  being  the  one  who  got  the  money  identifies  him  as  the 
kidnapper.  Consequently,  if  after  processing  this  narrative  GEXESIS  is  asked:  "Who  captured 
Alice?"  it  replies:  "Bob  captured  Alice  and  locked  her  in  the  basement." 

B.2.  The  Arson  Example 

In  the  Arson  example.  GEXESIS  learns  a  schema  for  an  individual  burning  his  own  property 
in  order  to  collect  the  insurance  money.  A  version  of  this  example  was  first  used  in  the  original 
QT-'—ig  system  [MocneyS5b].  Before  processing  the  narratives  for  the  Arson  example.  GEXESIS 
has  schemata  for  burning,  insuring  objects,  and  being  indemnified  for  the  loss  of  property:  how¬ 
ever.  it  does  not  have  a  schema  for  arson-for-insurance. 

B.2.1.  Processing  the  Test  Narrative  Before  Learning 

In  order  to  demonstrate  the  system's  initial  ignorance  of  arson-for-insurance.  GEXESIS  is 
given  the  following  test  narrative: 

Input:  John  owned  a  barn.  He  burned  it.  He  got  40000  dollars. 


Readv  for  Questions: 


>  Summarize 
John  burned  a  barn. 


>  How  did  John  get  the  money0 
Cannot  find  sufficient  cause. 

>  Why  did  John  burn  the  barn? 
Cannot  find  sufficient  reason. 


>  Who  owned  the  barn0 
John  has  the  barn. 

The  parsed  version  of  this  narrative  is  shown  in  Figure  B.12  and  the  complete  causal  model  the  sys¬ 
tem  builds  for  this  narrative  is  shown  in  Figure  B.  1 3 .  Without  a  schema  for  this  situation  or  any 
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(Isa  Person25  Person)  (Gender  Person25  male)  (.Name  Person25  John) 

'Isa  Barr.l  Barn)  (Possess  Person25  Barnl)  (Burn  Person25  Barnl) 

(Isa  MonevO  Money)  (Amount  Monev5  40000  dollar)  (Possess  Person25  MonevS) 


Figure  B.12:  Parsed  Version  of  the  Arson  Test  Narrative 
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Isa7  — Isa5 
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Gender  1  Possessl 


Believe2 

Burnt2 


Possess2 

Amountl 


Burnt2 

Burnl 

Isa5 

Isa  7 

Isa2 

Isal 

Not2 

Flammable2 

Believe2 

Isa9 

Gender  1 
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Possessl 

Possess2 

Amountl 


The  barn  is  burnt. 

John  burned  the  barn. 

The  barn  is  an  inanimate  object. 

The  barn  is  a  building. 

The  barn  is  a  barn. 

John  is  a  person. 

The  barn  was  not  burnt. 

The  barn  is  flammable. 

John  believes  that  the  barn  is  burnt. 
The  S40000  is  money. 

John  is  male. 

John's  name  is  John. 

John  has  the  barn. 

John  has  the  $40000. 

The  S40000  is  40000  dollars  of  money. 


Figure  B.13:  Causal  Model  for  Arson  Test  Example  Before  Learning 


mention  of  insurance  or  indemnity  in  the  text,  the  system  is  unable  to  construct  a  satisfactory 
causai  model:  and  John's  possession  of  the  money  is  left  unconnected  to  his  act  of  burning  his  barn. 
Consequently,  the  system  is  unable  to  answer  the  questions  presented  in  the  trace. 

B.2.2.  Processing  the  Learning  Narrative 

Next,  the  system  is  given  a  more  detailed  narrative  about  arson-for-insurance.  which  it  can 
explain  and  generalize  into  a  schema. 
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Input  Stan  owned  a  warehouse.  He  insured  it  against  hre  for  100000  dollars  Stan  burned  the 
warehouse.  He  called  Prudential  and  told  them  it  was  burnt.  Prudential  paid  him  100000  dol- 


Thematic  goal  achieved:  Stan  is  happy  that  Stan  is  insured  with  the  insurance  company  for  the 
S100000  if  the  warehouse  is  burnt. 

Thematic  goal  achieved:  Stan  is  happy  that  Stan  has  the  S100000 
Explanation  suitable  for  generalization.  Pruning. ..Generalizing. ..Packaging... 

Creating  New  Schema:  (InsureObjectBurnlndemmfy  7a3  7o2  ?c3  7v3  714) 


°a3  is  a  person.  ?o2  is  an  inanimate  object.  7c3  is  an  insurance  company.  ?v3  is  money.  ?o2  is 
not  burnt.  7a3  has  ?c2.  ?a3  insures  °o2  with  °c3  for  ?v3  in  case  it  is  burnt.  ?o2  is  flammable. 
7a3  burns  ?o2.  7a3  contacts  7c3  and  tells  it  that  ?o2  is  burnt.  7c3  indemnifies  ?a3  7v3  for  the 
loss  of  7o2. 


Having:  7a3  insured  ?o2  with  7c3  for  ?v3  in  case  it  is  burnt,  suggest  InsureObjectBurnlndemmfy 
Having:  7a3  burned  ?o2.  suggest  InsureObjectBurnlndemmfy 

Having:  °a3  contacted  ?c3  and  told  it  that  ?o2  was  burnt,  suggest  InsureObjectBurnlndemmfy 
Ready  for  questions: 

>  Summarize 

A  warehouse  was  not  burnt.  Stan  had  the  warehouse.  Stan  insured  the  warehouse  with  Pru¬ 
dential  for  S10Q000  in  case  it  is  burnt.  The  warehouse  was  flammable.  Stan  burned  the  ware¬ 
house.  Stan  contacted  Prudential  and  told  them  that  the  warehouse  was  burnt.  Prudential 
indemnified  Stan  the  $100000  for  the  loss  of  the  warehouse. 

>Why  did  Stan  insure  the  warehouse? 

So  Prudential  would  indemnify  Stan  the  S 100000  for  the  loss  of  the  warehouse. 

>  Why  did  Stan  burn  the  wharehouse7 
= WAREHOUSE1 
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Sc  Stan  could  call  Prudential  ana  could  tell  them  that  the  warehouse  was  burnt. 


>  Why  did  Stan  tell  Prudential  the  warehouse  was  burnt0 

So  Prudential  would  indemnify  Stan  the  SlOOOOO  for  the  loss  of  the  warehouse. 


>  Why  did  Prudential  pay  Stan  the  money? 

Because  Prudential  believed  that  the  warehouse  was  burnt  and  because  Stan  was  insured  with 
Prudential  for  the  SlOOOOO  if  the  warehouse  was  burnt. 


>  Why  did  Prudential  believe  the  warehouse  was  burnt? 

Because  Stan  called  Prudential  and  told  them  that  the  warehouse  was  burnt. 

The  parsed  version  of  this  narrative  is  shown  in  Figure  B.14  and  the  explanation  the  system  con¬ 
structs  for  why  Stan  has  the  money  is  shown  in  Figure  B.15.  This  explanation  can  be  summarized 
as  follows.  Stan  insuring  the  warehouse  has  the  effect  of  making  it  insured  and  Stan  burning  the 
barn  has  the  effect  of  making  it  burnt.  When  Stan  calls  Prudential  it  suggests  a  Telephone  schema, 
which  is  determined  when  Stan  tells  them  that  the  barn  is  burnt.  Prudential's  action  of  giving  Stan 
SI 00.000  is  recognized  as  an  instance.of  the  Indemnify  schema.  Since  Indemnify  is  a  specialization 
of  Atrans  (see  the  GENESIS  Action  hierarchy  in  chapter  9)  and  since  the  actor  of  an  Indemnify  is 
constrained  to  be  an  insurance  company  (which  Prudential  is  known  to  be),  the  Atrans  is  special¬ 
ized  to  an  Indemnify.  This  transformation  makes  use  of  one  of  Lvtinen’s  [Lytiner.S4]  parsing  rules. 


(Isa  Person26  Person)  (Gender  Person26  male)  (Name  Person26  Stan)  (Isa  Warehousel  Warehouse) 
(Possess  Person26  Warehousel)  (Isa  Money6  Money)  (Amount  Monev6  1 00000  dollar) 
(InsureObject  Person26  (Burnt  Warehousel)  Money6  7COMPANY2)  (Bum  Person26  Warehousel) 
y  (Isa  InsuranceCol  InsuranceCo)  (Name  InsuranceCol  Prudential) 

(DialTelephone  Person26  InsuranceCol  ?NUMBER135) 
v.  (Mtrans  Person26  (Burnt  Warehousel)  InsuranceCol) 
y.)  (Atrans  InsuranceCol  Monev6  Person26  ?ATS5) 

Figure  B.14:  Parsed  Version  of  the  Arson  Learning  Narrative 


‘The  1NTERLISP  spelling  corrector  [TeilelnanSi]  is  usee  to  correct  such  typos. 


!sa2 


Genderl  Narael  Name2  Amountl 


ThemeGoalMet! 

Isa34 

lsa3 

Possess6 

Atransl 

Indemnify! 

Insured2 

InsureObjec:  1 

Isa  17 

Isa9 

Isal  1 

Isa2 

Isa  13 

isal 

Not2 

Possess  1 

Believed 

Communicate! 

Telephone! 

Know! 

Isa!8 
Isa20 
Believe2 
Burn  I 

Fiammable2 

Burnt! 

Isa3! 

Isa32 

Possess" 

Not6 
Gender  1 
Name! 

Name! 

Amount  I 


Stan  is  happy  that  Stan  has  the  S100000. 

The  SIOOOOO  is  a  valuable. 

The  SIOOOOO  is  money. 

Stan  has  the  SIOOOOO. 

Prudential  gave  Stan  the  SIOOOOO. 

Prudential  indemnified  Stan  the  SIOOOOO  for  the  loss  of  the  warehouse. 

Stan  is  insured  with  Prudential  for  the  SIOOOOO  if  the  warehouse  is  burnt. 

Stan  insured  the  warehouse  with  Prudential  for  the  SIOOOOO  in  case  it  is  burnt. 
Prudential  is  an  insurance  company. 

The  warehouse  is  an  inanimate  object. 

The  warehouse  is  a  building. 

The  warehouse  is  a  warehouse. 

Stan  is  a  character. 

Stan  is  a  person. 

The  warehouse  was  not  burnt. 

Stan  has  the  warehouse. 

Prudential  believes  that  the  warehouse  is  burnt. 

Stan  contacted  Prudential  and  told  them  that  the  warehouse  was  burnt. 

Stan  called  Prudential  and  told  them  that  the  warehouse  was  burnt. 

Stan  knows  that  Prudential  has  phone  number  something. 

Prudential  is  a  character. 

Prudential  is  a  company. 

Stan  believes  that  the  warehouse  is  burnt. 

Stan  burned  the  warehouse. 

The  warehouse  is  flammable. 

The  warehouse  is  burnt. 

Some  place  is  a  location. 

The  SIOOOOO  is  an  inanimate  object. 

Prudential  had  the  SIOOOOO. 

Prudential  does  not  have  the  SIOOOOO. 

Stan  is  male. 

Stan's  name  is  Stan. 

Prudential's  name  is  Prudential. 

The  SIOOOOO  is  100000  dollars  of  money. 


Figure  B.15:  Arson  Example  —  Specific  Explanation 
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the  original  definition  of  which  is  given  below: 

SLOT-FILLER-SPECIALIZATION  RULE.  If  a  slot  cf  concept  A  is  filled  by  concept  B.  and  B  is 
the  prototypical  filler  lor  that  slot  of  concept  C  and  concept  C  ISA-A  concept  A.  then  change 
the  representation  of  concept  A  to  concept  C  (p.224). 

In  the  example.  A  is  the  Atrans.  B  is  InsuranceCo.  and  C  is  Indemnify.  The  preconditions  and 
motivations  of  the  Indemnify  include  that  Prudential  believe  that  the  warehouse  is  burnt  (which 
matches  an  effect  of  Stan's  Telephone  action)  and  that  the  warehouse  is  insured  against  fire  for 
$100,000  (which  matches  an  effect  of  Stan's  InsureObject  action).  The  resulting  explanation  as 
shown  in  Figure  B.15  allows  GENESIS  to  answer  the  questions  about  this  narrative  presented  in  the 


B.2.3.  Learning  InsureObjectBurnlndemnify 

After  the  effects  of  the  Indemnify  action  are  added.  GENESIS  detects  that  Stan  has  achieved 
the  thematic  goal  of  possessing  a  valuable  item  in  a  novel  way.  The  only  action  not  volitionallv 
performed  by  Stan  (the  main  character)  is  Prudential’s  Indemnify  action  and  this  action  is 
motivated  by  the  fact  that  they  believe  the  warehouse  is  burnt,  which  is  in  turn  supported  by 
Stan's  Burn  and  Communicate  actions.  Consequently  the  explanation  satisfies  all  of  the  learning 
criteria  and  the  system  proceeds  to  learn  a  new  schema  from  its  explanation  for  how  Stan's  goal 
was  achieved.  Prior  to  generalization,  several  units  are  pruned  from  the  explanation  shown  in  Fig¬ 
ure  B.15.  In  addition  to  pruning  the  fact  that  the  burned  item  is  a  warehouse,  the  specialization  of 
Communicate  to  Telephone  is  also  pruned  as  it  was  in  the  kidnapping  example.  The  final  general¬ 
ized  explanation  EGGS  produces  for  this  example  is  shown  in  Figure  B.16. 

The  packaging  of  this  generalized  explanation  into  a  schema  detects  an  interesting  protection 
violation  and  imposes  an  additional  ordering  constraint  to  prevent  it.  The  system  detects  that  the 
Burn  action  can  potentially  clobber  a  precondition  of  the  InsureObject  action,  namely  that  the  item 
being  insured  cannot  already  be  m  the  state  that  it  is  being  insured  against  (being  burnt  in  this 
case).  As  is  clear  from  the  graph  in  Figure  B.l6,  the  structure  of  the  explanation  itself  does  not 
impose  a  temporal  ordering  on  these  two  actions.  To  avoid  the  violation,  the  system  adds  the  foi- 
iowirg  constraint  to  the  new  schema: 

I.nsureObjectl  should  proceed  Burnl  when  T  or  else  Burnt2  will  clobber  N’ot7. 

The  final  definition  for  the  schema  learned  from  this  example  is  given  in  Figure  B.17. 
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°a3  is  happy  that  7a3  has  ?v3. 

lsa34 

?v3  is  a  valuable. 

Isa37 

7v3  is  money. 

Possess6 

°a3  has  9v3. 

Atransl 

9c3  gave  ?a3  7v3  at  ?14. 

Indemnify  1 

7c3  indemnified  ?a3  7v3  for  the  loss  of  7o2. 

Insured2 

7a3  is  insured  with  ?c3  for  °v3  if  7o2  is  burnt. 

InsureObjectl 

7a3  insured  ?o2  with  °c3  for  ?v3  in  case  it  is  burnt. 

Isa3S 

?c3  is  an  insurance  company. 

Isa40 

7o2  is  an  inanimate  object. 

Isa  13 

9a3  is  a  character. 

Isa39 

?a3  is  a  person. 

Not  7 

7o2  is  not  burnt. 

Pcssess2 

7a3  has  7o2. 

Believe4 

°c3  believes  that  ^02  is  burnt. 

Communicatel 

7a3  contacted  ?c3  and  told  them  that  ?o2  was  burnt 

Isa  IS 

7c3  is  a  character. 

Isa20 

^03  is  a  company. 

Believe2 

°a3  believes  that  7o2  is  burnt. 

Burnl 

7a3  burned  ?o2. 

Flammable  1 

7o2  is  flammable. 

Burnt2 

no2  is  burnt. 

Isa30 

714  is  a  location. 

Isa32 

7v3  is  an  inanimate  object. 

Possess' 

7c3  had  9v3. 

Noi6 

7c3  does  not  have  "v3. 
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Figure  B.I6:  Arson  Example  —  Generalized  Explanation 
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UlnsureObjectBurnlndemnifv  ?a3  9o2  9c3  °v3  ?14)  "?a3  did  something  to  ?o2.“ 

i  Constraint  (Isa  ?14  Location)  (Isa  9a3  Person)  (Isa  ?o2  Inanimate) 

(Isa  ?c3  InsuranceCo)  (Isa  ?v3  Money)) 

(Precondition  (Possess  ?c3  ?v3)  (Flammable  ?o2)  (Possess  ?a3  9o2)  (Not  (Burnt  ?o2))) 

(Effect  (Insured  (Burnt  ?o2)  9a3  ?v3  ?c3)  (Believe  ?a3  (Burnt  ?o2))  (Burnt  ?o2) 

(Believe  ?c3  (Burnt  ?o2))  (Not  (Possess  ?c3  ?v3))  (Possess  ?a3  ?v3)) 

(Subaction  (InsureObject  ?a3  (Burnt  ?o2)  9v3  ?c3)  (Burn  9a3  ?o2) 

(Communicate  ?a3  9c3  (Burnt  ?o2))  (Indemnify  ?c3  ?v3  ?a3  (Burnt  ?o2)) 

(Atrans  ?c3  ?v3  ?a3  ?14 )) 

(Internal  (Isa  ?v3  Inanimate)  (Isa  9c3  Character)  (Isa  ?c3  Company)  (Isa  ?a3  Character! 

(Isa  ?v3  Valuable)) 

(Links  (Antecedent  (Internal  l)  (Internal  5))  (Antecedent  (Internal  2)  (Internal  3)) 

(Antecedent  (Internal  3)  (Constraint  4))  (Antecedent  (Internal  4)  (Constraint  2)) 
(Antecedent  (Internal  5)  (Constraint  5))  (Precondition  (Subaction  1)  (Precondition  4)) 
(Precondition  (Subaction  l)  (Precondition  3))  (Constraint  (Subaction  1)  (Constraint  4)) 
(Constraint  (Subaction  1)  (Constraint  5))  (Constraint  (Subaction  1)  (Constraint  3)) 
(Constraint  (Subaction  1)  (Internal  4))  (Effect  (Subaction  1)  (Effect  l)) 

(Precondition  (Subaction  2)  (Precondition  4))  (Precondition  (Subaction  2)  (Precondition  2)) 
(Constraint  (Subaction  2)  (Constraint  3))  (Constraint  (Subaction  2)  (Constraint  2)) 

(Effect  (Subaction  2)  (Effect  2))  (Effect  (Subaction  2)  (Effect  3)) 

(Precondition  (Subaction  3)  (Effect  2))  (Constraint  (Subaction  3)  (Internal  2)) 

(Constraint  (Subaction  3)  (Internal  4))  (Effect  (Subaction  3)  (Effect  4)) 

(Motivation  (Subaction  4)  (Effect  1))  (Motivation  (Subaction  4)  (Effect  4)) 

(Constraint  (Subaction  4)  (Constraint  3))  (Constraint  (Subaction  4)  (Constraint  5)) 
(Constraint  (Subaction  4)  (Constraint  4))  (Antecedent  (Subaction  5)  (Subaction  4)) 
(Precondition  (Subaction  5)  (Precondition  1))  (Constraint  (Subaction  5)  (Constraint  l)) 
(Constraint  (Subaction  5)  (Internal  4))  (Constraint  (Subaction  5)  (Internal  l)) 

(Constraint  (Subaction  5)  (Internal  2))  (Effect  (Subaction  5)  (Effect  5)) 

(Effect  (Subaction  5)  (Effect  6))) 

(Ordering  ((Before  (Subaction  l)  (Subaction  2))  T)) 

Figure  B.I7:  InsureObjectBurnlndemnify  Schema  Definition 


B.2.4.  Processing  the  Test  Narrative  After  Learning 

Below  is  the  trace  produced  by  GENESIS  when  processing  the  Arson  test  narrative  for  the 
second  time: 

Input:  John  owned  a  barn.  He  burned  it.  He  got  40000  dollars. 

Thematic  goal  achieved:  John  is  happy  that  John  is  insured  with  the  insurance  company  for  the 
S40000  if  the  barn  is  burnt. 

Thematic  goal  achieved:  John  is  happy  that  John  has  the  S4000G. 


NS 


A  barn  was  not  burnt.  John  had  the  barn.  John  insured  the  barn  with  an  insurance  compar.v 
for  S40000  in  case  it  is  burnt.  The  barn  was  flammable.  John  burned  the  barn.  John  contacted 
the  insurance  company  and  told  them  that  the  barn  was  burnt.  The  insurance  company 
indemnified  John  the  $40000  for  the  loss  of  the  barn. 

>  How  did  John  get  the  money0 

The  insurance  company  indemnified  John  the  S40000  for  the  loss  of  the  barn. 

>  Why  did  the  company  pay  John  the  money? 

Because  John  was  insured  with  the  insurance  company  for  the  S40000  if  the  barn  was  burnt 
and  because  the  insurance  company  believed  that  the  barn  was  burnt. 

>  Why  did  John  burn  the  barn0 

So  John  couid  contact  the  insurance  company  and  could  tell  them  that  the  barn  was  burnt. 
>Why  did  John  insure  the  barn 

Sc  the  insurance  company  would  indemnify  John  the  S40000  for  the  loss  of  the  barn. 

>  Why  did  the  company  believe  the  barn  was  burnt? 

Because  John  contacted  the  insurance  company  and  told  them  that  the  barn  was  burnt. 

his  time.  John's  action  of  burning  his  barn  suggests  the  InsureObjectBurnlndemmf v  schema. 

.  hich  is  determined  when  its  effect  that  John  acquires  money  is  confirmed.  The  explanation  con¬ 
ducted  :or  why  John  has  the  money  is  shown  in  Figure  B.1S.  This  explanation  allows  it  to 
nswer  questions  it  'was  previously  unable  to  answer  and  the  new  schema  provides  a  reasonable 
araphrase  of  the  narrative. 

It  is  also  interesting  to  look  at  some  variations  of  the  test  narrative  that  either  do  or  do  not 
u.  eke  the  InsureObjectBurnlndemmfy  schema.  For  example,  the  following  narrative  does  not 
yoke  the  new  schema: 
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Isa  17 

Possess4 

Atransl 

Isal3 

I sal  4 

Isa  18 

Isa  12 

Isal5 

Isal 

Isa  19 

Possessh 

Indemnifyl 

3elieve3 

Communicatel 

Believe2 

Burnl 

Isa3 

Isai 

Isal 

Flammable! 

Not3 

Burnt! 

Insured  1 
InsureObjeet  1 
Possess  1 
Not! 


John  is  happy  that  John  has  the  S40000. 

The  S40000  is  a  valuable. 

The  S40000  is  money. 

John  has  the  540000. 

The  insurance  company  gave  John  the  S40000. 

The  insurance  company  is  a  character. 

The  insurance  company  is  a  company. 

The  insurance  company  is  an  insurance  company. 

The  540000  is  an  inanimate  object. 

John  is  a  character. 

John  is  a  person. 

Some  place  is  a  location. 

The  insurance  company  had  the  S40000. 

The  insurance  company  indemnified  John  the  S40000  for  the  loss  of  the  barn. 

The  insurance  company  believes  that  the  barn  is  burnt. 

John  contacted  the  insurance  company  and  told  them  that  the  barn  »  as  burnt. 

John  believes  that  the  barn  is  burnt. 

John  burned  the  barn. 

The  barn  is  an  inanimate  object. 

The  barn  is  a  building. 

The  barn  is  a  barn. 

The  barn  is  flammable. 

The  barn  vas  not  burnt. 

The  barn  is  burnt. 

John  is  insured  vith  the  insurance  company  for  the  S40000  if  the  barn  is  burnt. 

John  insured  the  barn  u-ith  the  insurance  company  for  the  540000  in  case  it  is  burnt. 
John  has  the  barn. 

The  insurance  company  does  not  have  the  S40000. 


Figure  B.18:  Arson  Test  Example  After  Learning  —  Specific  Explanation 


Input:  John  owned  a  barn.  He  insured  it  against  fire  for  100000  dollars.  Tne  barn  is  burnt.  All¬ 


state  gave  John  100000  dollars. 


Thematic  goal  achieved:  John  is  happy  that  John  is  insured  with  the  insurance  company  for  the 
S100000  if  the  barn  is  burnt. 

Thematic  goal  achieved:  John  is  happy  that  John  has  the  SI 00000. 

Ready  for  questions: 

>  Why  is  the  barn  burnt? 

Cannot  find  sufficient  reason. 

>  Who  burned  the  barn? 

Question  refers  to  unknown  event,  person,  place,  or  thing. 

>  Summarize 

John  insured  a  barn  with  Allstate  for  S100000  in  case  it  is  burnt. 

Allstate  indemnified  John  the  S100000  for  the  loss  of  the  barn. 

Although  the  action  of  insuring  an  item  against  fire  was  made  to  suggest  the  new  schema,  the  act  of 
insuring  an  item  already  achieves  a  known  thematic  goal  of  wanting  security  against  the  loss  of 
one's  valuable  items.  Consequently,  since  actions  that  are  already  explained  by  a  thematic  goal  are 
not  used  to  suggest  schemata  (see  section  10.2),  InsureObjectBurnlndemnify  is  not  suggested  in  this 
situation. 

The  follow  ing  variation  of  the  test  narrative:  however,  does  invoke  the  new  schema: 

Gene  owned  a  restaurant.  Someone  burned  the  restaurant.  Allstate  gave  Gene  100000  dollars. 

If  after  processing  this  narrative  GENESIS  is  asked:  "Who  burned  the  restaurant?"  it  replies: 
"Gene  burned  the  restaurant."  This  time,  the  fact  that  someone  intentionally  burned  the  restau¬ 
rant  does  suggest  the  new-  arson-for-insurance  schema  since  the  system  has  no  other  explanation  for 
this  action.  The  fact  that  Gene  got  the  money  identifies  him  as  the  agent  and  determines  the 
schema.  Of  course,  since  understanding  is  a  subjective  process,  this  conclusion  could  be  wrong. 
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B.3.  The  Solicit  Example 

Since  all  of  the  other  schemata  GENESIS  has  learned  involve  illegal  means  of  acquiring 
wealth,  it  only  seemed  fair  to  also  give  the  system  the  opportunity  to  learn  a  schema  for  upholding 
the  law.  The  system  was  therefore  given  an  example  from  which  it  could  learn  a  schema  for  a 
police-officer  impersonating  a  prostitute  in  order  to  arrest  potential  customers.  Before  processing 
the  narratives  in  this  example,  the  system  has  schemata  for  prostitution  customers  solicting  sexual 
favors  and  for  police-officers  arresting  law-breakers:  however,  it  does  not  have  a  schema  for  "soli¬ 
citation  entrapment." 

B.3.1.  Processing  the  Test  Narrative  Before  Learning 

First  the  system  is  presented  with  the  following  test  narrative: 

Input:  Alice  was  a  policewoman.  Stan  told  Alice  if  she  had  sex  with  him  then  he  would  give 

her  S75.  Stan  went  to  jail. 

Ready  for  questions: 

>  Summarize 

Stan  solicited  Alice's  sexual  favors  for  S75.  Stan  was  put  in  a  jail. 

>  Why  did  Stan  tell  Alice  if  she  had  sex  with  him  then  he  would  give  her  money? 
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Because  Stan  believed  that  Alice  was  a  prostitute  and  because  Stan  was  horny. 


>  Why  did  Stan  go  to  jail? 

Because  Stan  was  under  arrest  for  something. 


>  Why  was  Stan  under  arrest? 
Cannot  find  sufficient  reason. 


The  parsed  version  of  this  narrative  is  presented  in  Figure  B.19  and  the  causal  model  constructed 
for  the  narrative  is  shown  in  Figure  B.20.  The  fact  that  Stan  asked  Alice  to  have  sex  with  him  for 
money  suggests  and  determines  a  Solicit  schema1  and  the  fact  that  Stan  went  to  jail  suggests  and 
determines  an  Incarcerate  schema.  However,  without  a  schema  for  the  overall  situation  or  anv 
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(Isa  Person34  Person)  (Name  Person34  Alice)  (Gender  Person34  female) 
(Occupation  Person34  police-officer)  (Isa  Person35  Person)  (Gender  Persor.35  male) 
(Name  Person35  Stan)  (Isa  Money  10  Money)  (Amount  MonevlO  75  dollar) 
(Mtrans  Person35  (Implies  (Coitus  Person34  Person35) 

(Atrans  Person35  MonevlO  Person34  ?AT110))  Person34) 
(Isa  Jail  1  Jail)  (Isa  Location?  Location)  (At  Jail  1  Location?) 

(Ptrans  Person35  Perscn35  7FROM1179  Location?) 


Figure  B.19:  Parsed  Version  of  the  Solicit  Test  Example 


explicit  mention  of  the  arrest,  the  system  is  unable  to  connect  Stan’s  incarceration  with  his  solicita¬ 
tion.  The  system  is  therefore  unable  to  appropriately  answer  some  of  the  questions  shown  in  the 


BJ.2.  Processing  the  Learning  Narrative 


Next  the  system  is  given  the  following  learning  narrative: 


Input:  Jane  is  a  policewoman.  She  dressed  in  a  short  red  skirt  and  went  to  a  corner.  Bob 
approached  the  corner  and  told  her  if  she  had  sex  with  him  then  he  would  give  her  S50.  Jane 
arrested  |3ob  for  soliciting.  Bob  is  Mary's  husband  and  he  told  her  that  Jane  entrapped  him. 


Thematic  goal  achieved:  Jane  is  happy  that  Bob  is  under  arrest  for  soliciting  Jane's  sexual 


favors  for  the  S50. 


Explanation  suitable  for  generalization.  Pruning. ..Generalizing. ..Packaging. .. 
Creating  New  Schema:  (DressSolicit Arrest  ?b9  ?c2  ?s2  ?s4  ?14  ?a9  ?c3) 


?b9  is  not  equal  to  ?a9.  ?b9  is  a  person.  7c2  is  an  apparel.  ?b9  has  ,c2.  ?b9  puts  on  ?c2.  ?s2  is 
a  location.  ?I4  is  a  corner.  ?b9  is  at  ?s2.  ^9  goes  from  ?s2  to  ?14.  ?a9  is  a  person.  ?s4  is  a 
location.  0a9  is  at  ?s4.  ?a9  goes  from  ?s4  to  ?14.  ?c3  is  money.  ?a9  is  horny.  ?c2  is  sexy.  -1a9 
solicits  ?b9's  sexual  favors  for  ?c3.  °b9  is  a  police-officer.  ?b9  arrests  °a9  for  soliciting  l’b9's 


sexual  favors  for  '*c3. 


lT.-.«  Scicit  schema  represents  a  customer  offering  money  for  sexual  favors  as  opposed  to  the  more  common  English  in¬ 
terpretation  of  the  u-orc  '•solicit'’  referring  to  a  prostitute  offering  her  services  for  money. 
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Incarceraiel 

Arrested! 

All 

Isa  14 
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Believe! 
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Stan  is  at  the  jail. 

Stan  was  put  in  the  jail. 

Stan  is  under  arrest  for  something. 

The  jail  is  at  Stan. 

The  jail  is  a  location. 

The  jail  is  a  jail. 

Stan  is  a  person. 

Stan  went  to  the  jail. 

Alice  believes  that  Stan  solicited  Alice's  sexual  favors  for  the  S75. 
Stan  solicited  Alice's  sexual  favors  for  the  $75. 

Stan  is  horny. 

Stan  believes  that  Alice  is  a  prostitute. 

The  $75  is  money. 

Alice  is  a  person. 

Stan  has  a  communication  path  to  Alice. 

Stan  told  Alice  that  if  Alice  had  sex  with  Stan  then  Sian  would  giv 
Alice  the  S75. 

Alice's  name  :s  Alice. 

Stan's  name  is  Stan. 

Alice  is  female. 

Stan  is  male. 

Alice  is  a  police-officer. 

The  $75  is  75  dollars  of  money. 


Figure  B.20:  Causal  Model  for  Solcit  Test  Example  Before  Learning 


Having  b9  pul  on  ?c2.  suggest  DressSohcitArrest 

Having-  °a9  solicited  ",b9's  sexual  favors  for  °c3.  sugges:  DressSohcitArrest 

Having:  °b9  arrested  °a9  for  sciicilir.g  ',b9's  sexual  favors  for  7c3.  suggest  DressSohcitArrest 

Unknown  word  '?b9  entrap  °a9"  refers  to  DressSohcitArrest 

Ready  for  questions: 


>  Summarize 

Jane  had  a  skirt.  Jane  put  on  the  skirt.  Jane  was  at  some  place.  Jane  went  to  a  corner.  Bob 
was  at  some  place.  Bob  went  to  the  corner.  Bob  was  horny  The  skirt  was  sexy.  Bob  solicited 
Jane's  sexual  favors  for  S50.  Jane  was  a  police-officer.  Jane  arrested  Bob  for  soliciting  Jane  s 
sexual  favors  for  the  S50. 


>  Why  did  Jane  put  on  the  short  skirt? 

So  Bob  would  solicit  Jane's  sexual  favors  for  the  S50. 


>  Why  did  Jane  gc  to  the  corner? 

Sc  Bob  wouid  solicit  Jane's  sexual  favors  for  the  S50. 


>  Why  did  Bob  go  to  the  corner? 

So  Bob  could  solicit  Jane's  sexual  favors  for  the  S50. 


>  Why  did  Bob  solicit  Jane's  sexual  favors? 

Because  Bob  believed  that  Jane  was  a  prostitute  and  because  Bob  was  horny. 


>  Why  did  Bob  believe  Jane  was  a  prostitute? 

Because  Bob  went  to  the  corner,  because  Jane  went  to  the  corner,  because  the  skirt  was  short 
and  because  Jane  put  on  the  skirt. 


>  Why  did  Jane  arrest  Bob7 

Because  Jane  believed  that  Bco  solicited  Jane's  sexual  favors  for  the  S50  and  because  Jane  was  a 
nci  ice-officer. 
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>  Why  did  Jane  beiieve  that  Bob  solicited  her  sexual  favors0 
Because  Bob  solicited  Jane's  sexual  favors  for  the  S50. 

The  parsed  version  of  this  narrative  is  shown  in  Figure  B.21  and  the  explanation  constructed  for 
w  hv  Bob  is  under  arrest  is  shown  in  Figure  B.22.  This  explanation  can  be  summarized  as  follows. 
Jane's  actions  of  putting  on  a  short  skirt  and  going  to  the  corner  has  the  effect  of  her  being  on  the 
corner  dressed  in  a  short  skirt.  Bob's  action  of  going  to  the  corner  results  in  a  communication  path 
between  Bob  and  Jane  and  enables  him  to  tell  her  that  he  would  give  her  money  if  she  had  sex  with 
him.  The  latter  action  is  recognized  as  a  Solicit  schema,  which  in  order  to  be  motivated  requires 
that  Bob  believe  that  Jane  is  a  prostitute.  This  belief  is  inferred  from  the  fact  that  she  is  wearing  a 
short  skirt  and  standing  on  a  corner.  Jane's  act  of  arresting  Bob  requires  several  motivations.  She 
must  believe  that  Bob  committed  an  illegal  act  (which  matches  an  effect  of  the  Solicit  since  solicit¬ 
ing  is  known  to  be  illegal)  and  she  must  be  a  police-officer  (which  matches  an  input  fact).  The 
resulting  explanation  as  shown  in  Figure  B.22  allows  GENESIS  to  answer  the  questions  about  this 
narrative  presented  in  the  trace 

B.3.3.  Learning  DressSolic  it  Arrest 

After  the  effects  of  the  Arrest  action  are  added.  GENESIS  detects  that  a  police-officer  has 
achieved  her  thematic  goal  of  having  someone  under  arrest.  The  only  action  not  volitionaliv  exe¬ 
cuted  by  Alice  (the  main  character)  is  Bob's  Solicit  action  and  this  action  is  motivated  by  the  fact 
that  he  believes  Alice  is  a  prostitute,  which  in  turn  is  supported  by  Alice's  actions  of  putting  on  a 


A 

A 


(Isa  Person3S  Person)  (Name  Person3S  Jane)  (Gender  Person38  female) 
(Occupation  Person3S  police-officer)  (Length  Skirtl  short)  ( Color  Skirtl  red) 


(Isa  Skirtl  Skirt)  (Dress  Person38  Skirtl)  (Isa  Cornerl  Corner) 

(Ptrans  Person3S  Perscn38  7FROM11S2  Cornerl)  (Isa  Person39  Person) 

f Gender  Person39  male)  f Name  Person39  Bob)  (Ptrans  Person39  Person39  °FROM120S  Cornerl) 
(Isa  Moneyll  Money)  (Amount  Money  11  50  dollar) 

(Mtrans  Person39  (Implies  (Coitus  Persor.38  Person39) 

(Atrans  Person39  Moneyll  Person3S  °AT134))  Person3S) 

(Arrest  Person3S  Person39  (Solicit  Person39  °SL"BJECT97  °FGR3))  (Isa  Person43  Person) 

(Name  Person43  Mary)  (Gender  Person43  female)  (Husband  Person39  Person43) 

' Mtrans  Person39  (?Actior.2  Person3S  Person39)  Person43) 


Figure  B.21:  Parsed  Version  of  the  Solicit  Narrative 
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Arrest  1 
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Solicit  1 

NeedSex2 
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Jane's  name  is  Jane.  Name2 

Jane  is  female.  Gender2 

The  skirt  is  red.  Amount  1 

The  skirt  is  an  apparel.  Isa2 

Jane  is  a  physical  object.  Isal7 

Jane  is  a  character.  Isal 

Bob  is  a  physical  object.  Isa29 

Bob  is  a  character.  Isa21 


Bob’s  name  is  Bob. 

Bob  is  male. 

The  S50  is  50  dollars  of  money. 
The  skirt  is  a  skirt. 

Jane  is  an  animate  object 
Jane  is  a  person. 

Bob  is  an  animate  object. 

Bob  is  a  person. 


Jane  is  happy  that  Bob  is  under  arrest  for  soliciting  Jane's  sexual  favors  for  the  S50. 
Jane  is  a  police-officer. 

Bob  is  under  arrest  for  soliciting  Jane’s  sexual  favors  for  the  S50. 

Jane  arrested  3ob  for  soliciting  Jane's  sexual  favors  for  the  S50. 

Jane  believes  that  Bob  solicited  Jane's  sexual  favors  for  the  S50. 

Bob  solicited  Jane's  sexual  favors  for  the  $50. 

3ob  is  horny. 

3ob  believes  that  Jane  is  a  prostitute. 

Jane  is  wearing  the  skirt. 

Jane  put  on  the  skirt. 

Jane  has  the  skirt. 

The  skirt  is  sexy. 

The  skirt  is  short. 

Jane  is  at  the  corner. 

Jane  went  to  the  corner. 

The  corner  is  a  location. 

The  corner  is  a  corner. 

Jane  is  a  location. 

Jane  was  at  some  place. 

Jane  is  not  at  some  place. 

Bob  is  at  the  corner. 

Bob  went  to  the  corner. 

3ob  is  a  location. 

3ob  was  at  some  place. 

Bob  is  not  at  some  place. 

The  S50  is  money. 

Bob  has  a  communication  path  to  Jane. 

It  is  illegal  for  Bob  to  solicit  Jane's  sexual  favors  for  the  S50. 

Jane  is  not  equal  to  Bob. 

Figure  B.22:  Solicit  Example  —  Specific  Explanation 
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short  skirl  and  going  to  a  corner.  Since  all  of  the  learning  criteria  are  met.  the  system  proceeds  to 
'earn  a  new  schema  from  its  explanation  of  how  Alice's  goal  u>as  achieved.  Prior  to  generalization, 
the  only  pruning  that  is  performed  is  the  removal  of  the  facts  requiring  that  the  police-officer's 
seductive  clothing  be  a  short  skirt.  These  facts  are  removed  since  the  system  knows  other  ways  of 
inferring  that  an  apparel  is  seductive  (e.g  tight  jeans).  The  generalized  explanation  for  this  example 
is  given  in  Figure  B.23  and  the  final  definition  for  the  learned  schema  is  given  in  Figure  B.24.  The 
observant  reader  will  notice  that  the  police-officer  is  not  constrained  to  be  female.  This  is  because 
the  rule  for  inferring  that  someone  believes  that  a  person  is  a  prostitute  does  not  require  that  per¬ 
son  to  be  female.  If  this  additional  antecedent  were  added  to  the  rule,  the  resulting  schema  would 
be  sexist  and  require  a  female  police-officer. 

In  addition  to  learning  a  new  schema  from  this  narrative.  GENESIS  also  learns  the  meaning 
of  a  new  word.  The  last  sentence  of  the  narrative  causes  it  to  associate  the  unknown  word 
"entrap"  with  the  newly  learned  schema.  This  word  is  learned  in  a  manner  directly  analogous  to 
the  learning  of  the  word  "kidnap"  as  discussed  in  chapter  . 

B.3.4.  Processing  the  Test  Narrative  After  Learning 

Below  is  the  trace  produced  by  GENESIS  w'hen  processing  the  test  narrative  after  having 
'.earned  the  new  schema: 

Input:  Alice  was  a  policewoman.  Stan  told  Alice  if  she  had  sex  with  him  then  he  would  give 
her  S 75.  Stan  went  to  jail. 

Thematic  goal  achieved:  Alice  is  happy  that  Stan  is  under  arrest  for  soliciting  Alice’s  sexual 
favors  for  the  S75. 


V, 

'V. 
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Ready  for  questions: 

>  Summarize 

Alice  had  an  apparel.  Aiice  put  on  the  apparel.  Alice  was  at  some  place.  Alice  went  to  a 
corner.  Stan  was  at  some  place.  Stan  a.  ent  to  the  corner.  Stan  was  horny.  The  apparel  was 
sexy.  Stan  solicited  Alice's  sexual  favors  for  $75.  Alice  was  a  police-officer.  Alice  arrested 
Star,  for  soliciting  Alice's  sexual  favors  for  the  S75. 
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?b9  is  happy  that  ?a9  is  under  arrest  for  soliciting  *b9's  sexual  favors  for  *c3. 
7b9  is  a  police-officer. 

7a9  is  under  arrest  for  soliciting  ?b9's  sexual  favors  for  7c3. 

*b9  arrested  ?a9  for  soliciting  ?b9‘s  sexual  favors  for  7c3. 

*b9  believes  that  7a9  solicited  ?b9's  sexual  favors  for  7c3. 

?a9  solicited  ?b9's  sexual  favors  for  7c3. 

""39  is  horny 

7a9  believes  that  7b9  is  a  prostitute. 

*b9  is  rearing  ?c2. 

7b9  put  on  ?c2. 

*c2  is  an  apparel. 

*o9  is  a  person. 

7b9  has  *c2. 

*c2  is  sexy. 

*b9  is  at  '’14. 

*b9  -vent  from  *s2  to  *14. 

*14  is  a  location. 

*14  is  a  corner. 

*s2  is  a  location. 

*b9  is  a  physical  object. 

*b9  is  an  animate  object. 

*b9  is  a  character. 

*b9  'cas  at  *s2 
*b9  is  not  at  *s2. 

*a9  is  at  *14. 

*a9  ’lent  from  7s4  to  ?14. 

*s4  is  a  location. 

7a9  is  a  physical  object. 

*a9  is  an  animate  object. 

7a9  is  a  character. 

*a9  is  a  person. 

*a9  ’l  as  at  *s4. 

*a9  is  not  at  *s4. 

7c 3  is  money. 

*a9  has  a  communication  path  to  7b9 

It  is  illegal  for  7a9  to  solicit  *b9's  sexual  favors  for  7c3. 

*b9  is  not  equal  to  *a9. 


Figure  B.23:  Solicit  Example  —  Generalized  Explanation 
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DressSolicitArrest  b9  c2  s2  7s4  14  ?a9  c3)  b9  entrapped  a9 

(Constraint  (Not  (EQLAL  °b9  °a9 ) 3  (Isa  7c3  Money)  (Isa  ?a9  Person)  (Isa  "s4  Location) 

(Isa  7s2  Location)  (Isa  °14  Corner)  (Isa  °b9  Person)  (Isa  ?c2  Apparel)) 
(Precondition  (At  °a9  ?s4 )  (At  '>b9  ?s2)  (Seductive  ?c2)  (Possess  °b9  ?c2)  (NeedSex  ?a9)i 
(Motivation  (Occupation  7b9  police-officer)) 

(Effect  (Attire  7b9'7c2)  (Not  (At  7b9  °s2))  (At  7b9  ?14)  (Not  (At  7a9  ?s4))  (At  °a9  ?14) 
(Believe  ?b9  (Solicit  ?a9  ?b9  7c3))  (Arrested  7a9  (Solicit  ?a9  7b9  7c3)  ?b9)l 
(Subaction  (Dress  °b9  7c2)  (Ptrans  ?b9  7b9  ?s2  714)  (Ptrans  7a9  ?a9  ?s4  714) 

(Solicit  °a9  °b9  °c3)  (Arrest  7b9  7a9  (Solicit  ?a9  7b9  °c3))) 

(Internal  (Illegal  (Solicit  0a9  ?b9  7c3))  (CommPath  ?a9  ?b9) 

(Believe  °a9  (Occupation  °b9  prostitute))  (Isa  7a9  PhysicalObject) 

(Isa  7a9  Animate)  (Isa  ?a9  Character)  (Isa  ^9  PhysicalObject) 

(Isa  7b9  Animate)  (Isa  7b9  Character)  (Isa  714  Location)) 

(Links  (Antecedent  (Internal  2)  (Effect  5))  (Antecedent  (Internal  2)  (Effect  3)) 

(Antecedent  (Internal  3)  (Effect  l))  (Antecedent  (Internal  3)  (Precondition  3)) 
(Antecedent  (Internal  3)  (Effect  3))  (Antecedent  (Internal  3)  (Effect  5)) 

(Antecedent  (Internal  3)  (Constraint  6))  (Antecedent  (Internal  4)  (Internal  5)) 
(Antecedent  (Internal  5)  (Internal  6))  (Antecedent  (Internal  6)  (Constraint  3)) 
(Antecedent  (Internal  7)  (Internal  S))  (Antecedent  (Internal  S)  (Internal  9)) 
(Antecedent  (Internal  9)  (Constraint  7))  (Antecedent  (Internal  10)  (Constraint  6)) 
(Precondition  (Subaction  1)  (Precondition  4))  (Constraint  (Subaction  l)  (Constraint  &)) 
(Constraint  (Subaction  1)  ('Constraint  7))  (Effect  (Subaction  1)  (Effect  1)) 

(Precondition  (Subaction  2)  (Precondition  2))  (Constraint  (Subaction  2)  (Internal  10)) 
(Constraint  (Subaction  2)  (Constraint  5))  (Constraint  (Subaction  2)  (Internal  7)) 
(Constraint  (Subaction  2)  (Internal  5))  (Effect  (Subaction  2)  (Effect  2;) 

(Effect  (Subaction  2)  (Effect  3))  (Precondition  (Subaction  3)  (Precondition  1)) 
(Constraint  (Subaction  3)  (Internal  10))  (Constraint  (Subaction  3)  (Constraint  4)) 
(Constraint  (Subaction  3)  (Internal  4))  (Constraint  (Subaction  3)  (Internal  5)) 

(Effect  (Subaction  3)  (Effect  4))  (Effect  (Subaction  3)  (Effect  5)) 

(Precondition  (Subaction  4)  (Internal  2))  (Motivation  (Subaction  4)  (Precondition  5)) 
(Motivation  (Subaction  4)  (Internal  3))  (Constraint  (Subaction  4)  (Constraint  2)) 
(Constraint  (Subaction  4)  (Constraint  7))  (Constraint  (Subaction  4)  (Constraint  3)) 
(Effect  (Subaction  4)  (Effect  6))  (Precondition  (Subaction  5)  (Effect  3)) 

(Precondition  (Subaction  5)  (Effect  5))  (Motivation  (Subaction  5)  (Motivation  1); 
(Motivation  (Subaction  5)  (Effect  6))  (’Constraint  (Subaction  5)  (Internal  D) 
(Constraint  (Subaction  5)  (Constraint  D)  (Constraint  (Subaction  5)  (Internal  6)) 
(Constraint  (Subaction  5)  (Internal  9))  (Effect  (Subaction  5)  (Effect  7))) 

Figure  B.24:  DressSolicitArrest  Schema  Definition 
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because  btar.  was  under  arrest  lor  sonciiir.s  Alice  s  sexuai  favors  !cr  the  5/0. 


>  Vv'hv  was  Stan  under  arrest0 


Because  Alice  entranced  Stan. 


>  Who  arrested  Sta.n° 

Alice  arrested  Stan  for  soliciting  Alice's  sexual  favors  for  the  S75. 


>  Why  did  Alice  arrest  Stan? 

Because  Alice  was  a  police-officer  and  because  Alice  believed  that  Stan  solicited  Alice’s  sexua 
favors  for  the  S75 . 


>  Why  did  Stan  tell  Alice  if  she  had  sex  with  him  then  he  would  give  her  money? 
Because  Stan  believed  that  Alice  was  a  prostitute  and  because  Stan  was  horny. 


>  Why  did  Stan  believe  that  Aiice  was  a  prostitute0 

3ecause  Alice  entrapped  Stan  and  because  the  apparel  was  sexy. 

This  time.  Star,  s  action  of  soliciting  Alice  suggests  the  new  DressSoiicitArrest  schema.  This  schema 
;s  determined  when  its  effect  of  Stan  being  under  arrest  for  solicitation  matches  a  fact  addeG  as  a 
precondition  of  his  incarceration.  The  system's  explanation  for  why  Stan  is  under  arrest  is  shown 
in  Figure  8.15 

The  fact  that  the  test  narrative  mentions  that  Alice  is  a  police-officer  is  not  crucial  to  the 
understanding  of  this  narrative.  For  example,  the  following  narrative  is  also  understood  using  the 
learned  schema. 


Alice  was  at  a  corner  wearing  tight  blue  jeans.  Stan  told  Alice  if  she  had  sex  with  him  then  he 
•■c  ould  give  her  S 75  Star,  went  to  jail. 

If  after  processing  this  narrative  GENESIS  is  asked:  "Who  arrested  Star.0."  it  replies:  "Alice 
arrested  Star,  for  soliciting  Alice's  sexuai  favors  for  the  S75."  Because  the  schema  suggestion  ar.c 
Getermmat.cr.  rmcedures  are  based  primarily  on  confirming  that  certain  actions  have  occurred,  the 
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Stan  is  under  arrest  for  soliciting  Alice's  sexual  favors  for  the  S75. 
Alice  arrested  Stan  for  soliciting  Alice's  sexual  favors  for  the  $75. 
Alice  believes  that  Stan  solicited  Alice's  sexual  favors  for  the  575. 
Stan  solicited  Alice's  sexual  favors  for  the  S75. 

Stan  is  horny. 

Stan  believes  that  Alice  is  a  prostitute. 

The  corner  is  a  corner. 

Stan  is  at  the  corner. 

Stan  vent  to  the  corner. 

Stan  is  an  animate  object. 

Stan  is  a  character. 

Stan  is  a  person. 

Stan  is  a  physical  object. 

Stan  is  a  location. 

The  corner  is  a  location. 

Stan  was  at  some  place. 

Stan  is  not  at  some  place. 

Alice  is  at  the  corner. 

Alice  went  to  the  corner. 

Alice  is  an  animate  object. 

Alice  is  a  character. 

Alice  is  a  person. 

Alice  is  a  physical  object. 

Alice  is  a  location. 

Alice  was  at  some  place. 

Alice  is  not  at  some  place. 

The  apparel  is  sexy. 

Alice  is  wearing  the  apparel. 

Alice  put  on  the  apparel. 

The  apparel  is  an  apparel. 

Alice  has  the  apparel. 

The  $”5  is  money. 

Stan  has  a  communication  path  to  Alice. 

Alice  is  a  police-officer. 

Alice  is  not  equal  to  Stan. 

It  is  illegal  for  Stan  to  solicit  Alice's  sexual  favors  for  the  S~5. 


Figure  B.25:  Solicit  Test  Example  After  Learning  —  Specific  Explanation 


■sol  e::ect  these  processes.  O:  course  a  better  understanding  system  would  probably  consider  the 
number  of  a  schema's  preconditions  that  are  known  to  be  satisfied  and  use  this  miormauor.  to  help 
choose  between  competing  interpretations  of  the  text. 

GENESIS  can  also  use  its  conjectured  meaning  for  the  word  "entrap"  to  help  understand  nar¬ 
ratives  that  it  would  otherwise  find  incomprehensible.  For  example,  in  the  following  narrative,  the 
presence  of  the  word  "entrap"  is  the  only  thing  that  suggests  the  DressSolicitArrest  schema. 

Input:  Jan  entrapped  Mike  Mike  went  to  jail. 

Thematic  goal  achieved:  Jan  is  happy  that  Mike  is  under  arrest  for  soliciting  Jan’s  sexual 

favors  for  the  money. 

Ready  for  questions: 

>  Summarize 

Jan  had  an  apparel.  Jan  put  on  the  apparel.  Jan  was  at  some  place.  Jan  went  to  a  corner. 

Mike  was  at  some  place.  Mike  went  to  the  corner.  Mike  was  horny.  The  apparel  was  sexy. 

Mike  soiic.tec  Jan's  sexual  favors  for  money.  Jan  was  a  police-officer.  Jan  arrested  Mike  for 

soliciting  Jan's  sexual  favors  for  the  money. 

Of  course,  the  English  word  "entrap"  does  not  actually  refer  specifically  to  "solicitation- 
entrapment:"  however,  given  the  system's  current  experience  with  this  word,  its  interpretation  of 
this  narrative  :s  reasonable. 
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C.l.  Action  Schema  Definitions 

((Die  ’’a)  "7a  died." 

(Constraint  (Isa  ?a  Person)) 

(Effect  (Dead  ?a))) 

(!  Inherit  (’a  M  ?d)  "?a  inherited  (M's  °o.“ 

(Constraint  (Isa  ?a  Character)  (Isa  ?d  Person)  (Isa  7o  Inanimate)) 

(Precondition  (Heir  ?a  7d)  (Possess  ?d  ?o)  (Dead  ?d)) 

(Effect  (Possess  7a  ?o)  (Not  (Possess  ?d  ?o)))) 

((.Wm  7a  ?o  °c)  "?a  won  7o  in  ?c." 

(Constraint  (Isa  ?a  Character)  (Isa  ?o  Inanimate)  (Isa  7c  Game)) 

(  Effect  ( Possess  ?a  °o))) 

((Aim  ’’a  ?o  7s)  "?a  aimed  7o  at  ?s." 

(Constraint  (Isa  ?a  Person)  (Isa  ?o  PhysicalObject)  (Isa  7s  PhysicalObject)) 

(Precondition  (LosPath  7a  °s)  (Possess  ?a  ?o)) 

(Effect  (Pointir.gAt  ?a  °o  ?s))) 

((Arrest  '’a  7b  Cd  °b  .  ?r))  "?a  arrested  ?b  for  (?d  7b  .  ?r)." 

(Constraint  (Isa  7a  Character)  (Isa  7b  Character)  (Not  (EQUAL  7a  ?b))  (Illegal  (?d  ?b  .  7r))) 
(Precondition  (At  nb  '’!)  (At  7a  71)) 

( Motivation  (Believe  ?a  (7d  .  ?r))  (Occupation  ?a  police-officer)) 

>'  Effect  (Arrested  °b  (°d  °b  .  ?r)  7a))) 

(lAtrans  °a  7o  ?t  ?1)  “?a  gave  7t  ?o  at  ?1." 

(Constraint  (Isa  ?a  Character)  (Isa  7o  Inanimate)  (Isa  ?t  Character)  (Isa  ?1  Location)) 
(Precondition  (Possess  '’a  7o)) 

(Effect  (Possess  ?t  7o)  (Not  (Possess  7a  °o))) 

I  Suggest  (( Atrans  7a  ?o  ?t  '*!)  —  —  >  (Poison  ?a  ?t  ?o)))) 

((Indemnify  ?a  "*0  ?t  ('*s  ?i))  "?a  indemnified  ?t  ?o  for  the  loss  of  ?i." 

1  Constraint  (Isa  °a  Insurar.ceCo)  < Isa  ?o  Money)  (Isa  7i  Inanimate)) 

Motivation  <  Believe  "a  f "’s  ?i))  (Insured  (?s  ?i)  7t  ?o  ?a))J 

(•'  Bargain  °a  °b  (?p  7a  .  7r)  (°d  '’b  .  ?D) 

"7a  and  '’b  carried  out  a  bargain  in  which  (?p  7a  .  ?r)  and  (?d  ?b  .  ?1)." 

:  Constraint  (Isa  '’a  Character)  (Isa  ?b  Character)) 

Precondition  (‘Pointer*  (0p  °a  .  ?r)  Precondition)  (‘Pointer*  (7d  '’b  .  ?l)  Precondition) 

(GoalPriority  °b  (‘Pointer*  (?p  7a  .  ?r)  PosEffect)  (‘Pointer*  (?d  ?b  .  71)  NegEffect)) 
(Believe  7b  ('Pointer*  (7p  7a  .  ?r)  Precondition))) 

'  Motivation  (GoalPriority  °a  (‘Pointer*  (7d  7b  .  ?l)  PosEffect)  (‘Pointer*  (?p  7a  .  ?r)  NegEffect))) 
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■  Effect  (‘Pointer*  (°d  °b  .  71)  PosEfTect)  (.‘Pointer*  (7p  °a  .  7r)  PosEfTect) 

(Not  (‘Pointer*  CM  °b  .  ?1)  NegEffect))  '  \ot  (‘Pointer*  (7p  ?a  .  7r)  NegEffect)) 

(Not  (Beiieve  °b  (‘Pointer*  (7p  °a  ?r)  NegEffect))) 

(Not  (Beiieve  °a  (‘Pointer*  (?p  °a  .  ?r)  NegEffect)))) 

(Subaction  (Mtrans  °a  (Implies  (?d  7b  .  ?1)  (7p  °a  °r))  °b)  l?d  7b  .  °1 )  (?p  7a  .  7r)) 

(Internal  (Believe  ?b  (Implies  (?d  ?b  .  ?1)  (7p  °a  °r)))  (Believe  ?a  (Implies  (?p  °a  ?r)  (7d  7b  .  71)))) 
(Links  (Effect  (Subaction  2)  (Effect  1))  (Effect  (Subaction  3)  (Effect  2)) 

(Effect  (Subaction  2)  (Effect  3))  (Effect  (Subaction  3)  (Effect  4)) 

(Precondition  (Subaction  2)  (Precondition  2))  (Effect  (Subaction  1)  (Internal  1)) 
(Precondition  (Subaction  3)  (Precondition  1 ))  (Motivation  (Subaction  2)  (Internal  l)) 
(Motivation  (Subaction  2)  (Precondition  3.0  (Motivation  (Subaction  3)  (Motivation  l)) 
(Motivation  vSubaction  3)  (Internal  2)))) 


'.(Burn  (’a  ?o)  '"’a  burned  °o." 

(Constraint  (Isa  ?a  Person)  (Isa  ?o  Inanimate)) 
(Precondition  (Flammable  ?o)  (Not  (Burnt  °o))) 
(Effect  (Burnt  7o)  (Believe  (’a  (Burnt  ?o)))) 


((Capture  7a  ?b  ?c  7r)  “’’a  captured  ?b  and  locked  it  in  ?r." 

(Constraint  (Isa  ?a  Character)  (Isa  ?b  Person)  (Isa  ?c  Location)  (Isa  ?r  Room)  (In  ?c  ?r)) 
(Precondition  (Free  ?b)) 


(Effect  (Captive  ?b  7a  ?c  ?r)  (Not  (Free  ?b))  (Believe  ?a  (Captive  ?b  ?a  ?c  ?r))) 

(Subaction  (Ptrans  7a  ?b  2d  ?c)  (Ptrans  ?a  ?b  M  ?d)  (Coniine  7a  ?b  7c  ?r)) 

(Internal  (At  7b  ?d)  (At  ?b  ?c)) 

(Links  (Constraint  (Subaction  3)  (Constraint  4))  (Constraint  (Subaction  3)  (Constraint  5)) 

(Constraint  (Subaction  3)  (Constraint  l))  (Constraint  (Subaction  1)  (Constraint  l)) 

(Constraint  (Subaction  1)  (Constraint  3))  (Constraint  (Subaction  2)  (Constraint  l)) 

(Effect  'Subaction  2)  (Internal  1))  (Precondition  (Subaction  l)  (Internal  l)) 

(Effect  (Subaction  l)  (Internal  2))  (Precondition  (Subaction  3)  (Internal  2)) 
(Precondition  (Subaction  3)  (Precondition  l))  (Effect  (Subaction  3)  (Effect  1)) 
(Effect  ^Subaction  3)  (Effect  2))  (Effect  (Subaction  3)  (Effect  3)))) 


'.(Communicate  "a  ?o  7i)  "7a  contacted  7o  and  told  it  that  ?i." 
(Constraint  (Isa  7a  Character)  (Isa  ?o  Character)) 
Precondition  (Believe  ?a  ’’i)) 
i  Effect  (Beiieve  M  ?:)) 


'Subaction  (Mtrans 


(Links  (Precondition  (Subaction  1)  (Precondition  1))  (Effect  (Subaction  l)  (Effect  l)) 

(Constraint  (Subaction  1)  (Constraint  1))  (Constraint  (Subaction  1)  (Constraint  2)))) 


( I  Telephone  7a  ?c  M'  "?a  called  ?o  and  told  it  that  ?i." 

Precondition  (Know  °a  (HasPhone.Number  7o  M))) 

'  Subaction  DialTeiephone  7a  ?o  ?n)  ( AnswerTeiephone  ?o  ?a)  (Taik  "a  ?i  ?o ) ) 

'  Internal  (PhoneRinging  ?o  ?a)  (CommPath  '’a  ?o)) 

(Links  (Precondition  (Subaction  1)  (Precondition  1))  (Effect  (Subaction  1)  (Internal  l)) 
(Precondition  (Subaction  2)  (Internal  1))  (Effect  (Subaction  2)  (Internal  2)) 
(Precondition  (Subaction  3)  (Internal  2))) 

Abstractions  '(Subaction  3)  Communicate  (Subaction  1)))) 


'  (  Confine  M  M  °c  7r)  "?a  confined  7b  7c." 

'  Suggest  ((Confine  M  M  ?c  7r )  -----  >  (Capture  7b  7c  ?r)))) 
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( (DialTelephone  '’a  °o  ’’n)  "°a  called  ')o." 

(Suggest  ((  DialTelephone  ’a  7o  ’n)  —  >  (Telephone  7  a  ''o  ?i)))))) 

((Dress  '’a  °c)  "?a  put  on  ?c." 

I  Constraint  (Isa  ’’a  Person)  (Isa  ?c  Apparel)) 

(Precondition  (Possess  ?a  ?c)) 

(Effect  (Attire  "’a  ?c))) 

((Incarcerate  ’’a  7o  7i  ?d)  "'’o  was  put  in  7j." 

(Constraint  (Isa  7o  Person)  (Isa  7j  Jail)  (Isa  ?d  Location)  (At  ?j  7c)) 

(Motivation  (Arrested  ?o  ?v  ?p)) 

(Effect  (At  7o  ^d)) 

•  (Subaction  (Ptrans  ?o  ?o  ?s  ?d)) 

(Links  (Constraint  (Subaction  l)  (Constraint  l))  (Constraint  (Subaction  l)  (Constraint  2)) 

(Constraint  (Subaction  1)  (Constraint  3))  (Motivation  (Subaction  l)  (Motivation  l)) 

(Effect  (Subaction  1)  (Effect  1)))) 

((InsureObject  ?a  (°s  ?o)  ?v  ?c)  "7a  insured  ?o  with  7c  for  %  in  case  (?s  7o)." 

(Constraint  (Isa  ?a  Character)  (Isa  ?o  Inanimate)  (Isa  °v  Money)  (Isa  ?c  Insura.nceCo)) 

(Precondition  (Possess  ?a  ?o)  (Not  (?s  7o))) 

(Effect  (Insured  (?s  '’o)  ?a  ?v  7c))) 

((Mtrans  '’a  ?i  ?o)  "'’a  told  ?o  that  7i." 

(Suggest  ((Mtrans  ?a  (Implies  (Coitus  7b  7a)  (Atrans  '’a  ?o  7b  '’l))  ?b)  — -  >  (Solicit  ?a  °b  ?o)) 

((Mtrans  7a  (Implies  (?p  ?b  .  ?r)  (?d  ?a  .  ?1))  '’b) - >  (Bargain  ’’a  ?b  (7d  7a  .  71)  (?p  7b  .  ?r))) 

((Mtrans  ?a  (Implies  (Not  (7p  7b  .  ?r))  (?d  ?a  .  °1))  7b)  - — > 

(Threaten  7a  7b  (?d  7a  .  71)  (?p  °b  .  7r ))))) 

((Murder  7a  ?v)  "7a  murdered  ?v." 

(Constraint  (Isa  °a  Character)  (Isa  ?v  Person)) 

(Effect  (Dead  ?v)) 

(Subaction  (GenesisAcuon  7a)  (Die  7v)) 

(Internal  (State  7v)  (State  ’’o)  (BadHealth  7v)) 

(Links  (Effect  (Subaction  l)  (Internal  D)  (Antecedent  (Internal  3)  (Internal  l)) 

(Antecedent  (Internal  3)  (Internal  2))  (Precondition  (Subaction  2)  (Internal  3)) 

(Effect  (Subaction  2)  (Effect  l)))) 

((Poison  7a  ?v  7p)  “'’a  poisoned  ?v  with  ?p." 

(Constraint  (Poisonous  7p)  (Isa  ?p  Food)) 

(Precondition  (Possess  7a  ?p)  (NeedSustenance  7v)) 

(Effect  (Inside  7p  7v)  (Not  (Possess  7a  ?p))  (Not  (Possess  ?v  ’’pj)) 

(Subacticn  (Atrans  '’a  ?p  ?v  71)  (Ingest  ?v  ?p)  (Die  ?v)) 

(Internal  (Possess  °v  7p)  (BadHealth  ?v)) 

(Links  (Precondition  (Subaction  I )  (Precondition  I ))  (Effect  (Subaction  1)  (Interna!  1)) 

(Effect  (Subaction  1)  (Effect  2))  (Constraint  (Subaction  2)  (Constraint  2)) 

'’Precondition  (Subaction  2)  (Internal  l))  (Motivation  (Subaction  2)  (Precondition  2)) 

(Effect  (Subaction  2)  (Effect  l))  (Effect  (Subaction  2)  (Effect  3)) 

(Antecedent  (Internal  2)  (Constraint  1))  (Antecedent  (Internal  2)  (Effect  l)) 

(Precondition  (Subaction  3)  (Internal  2))) 

(Abstractions  ('"Internal  2;  Murder  (Internal  3))  ((Internal  1 )  Murder  f  Internal  I )) 

((Effect  1)  Murder  (Internal  2))  ((Subaction  3;  Murder  i Subaction  2)) 

('Subaction  l)  Murder  (Subaction  l)))) 
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UShootToKill  0a  ?v  ?g)  "°a  shot  °v  and  killed  it." 

.Constraint  (Isa  "’g  Gun)) 

^Precondition  (Possess  ?a  7g)) 

1  Effect  (Shot  ?v)) 

(Subaction  (Shoot  ?a  ?v  ?g)  (Die  7v)) 

>  Internal  (  BadHealth  ?v)) 

1  Links  (Precondition  (Subaction  1)  (Precondition  l))  (Effect  (Subaction  l)  (Effect  1)) 
(Antecedent  (Internal  l)  (Effect  l))  (Precondition  (Subaction  2)  (Internal  1))) 

'  Abstractions  ((Effect  1)  Murder  (Internal  l))  ((Effect  1)  Murder  (Internal  2)) 

((Internal  l)  Murder  (Internal  3))  ((Subaction  2)  Murder  (Subaction  2)) 
((Subaction  l)  Murder  (Suoaction  1)))) 

uPtrans  ’’a  7o  7s  7d)  “?a  moved  7o  from  7s  to  ?d." 

(Constraint  (Isa  ?a  Animate)  (Isa  ?o  PhvsicalObject)  (Isa  ?s  Location)  (Isa  7d  Location)) 
(Precondition  (At  ?o  ?s)) 
i  Effect  (At  7o  ?d)  (Not  (At  7o  7s))) 

(Suggest  (((Ptrans  ?a  7o  ?s  ?d)  (Isa  ?j  Jail)  (At  7j  7d))  —  •—>  (Incarcerate  ?p  ?o  ?j  7d)))) 

((Drive  7a  7o  7s  ?d  ?c)  "7a  drove  7o  from  7s  to  ?d  in  ?c.“ 

(Constraint  (Isa  7c  Vehicle)) 

(Precondition  (Possess  7a  ?c)  (At  '’a  7i)  (At  7o  ?i)  (In  ?i  ?c)  (At  ?c  7s)) 

.Effect  (At  7a  I’d)  (At  7c  ?d)  (Not  (At  ?a  ?i))  (Not  (At  ?o  ?i))  (Not  (At  ?c  ?s)))) 

L  Release  °a  7b  7c  ?r)  “'’a  released  ?b." 

..Precondition  (Captive  ?b  ?a  ?c  ?r)) 

■  Effect  l  Free  7b)  (Not  (Captive  ?b  7a  7c  ?r)))) 

i  (Shoot  7a  ?v  7g)  "7a  shot  ?v  with  7g." 

Constraint  (Isa  ?g  Gun)  (Isa  (’a  Person)) 

Precondition  (Possess  ?a  ?g)(PointingAt  7a  7g  ?v)) 

'  Effect  (Shot  ?v)) 

i Suggest  ((Shoot  ?a  ?v  ?g)  >  (ShootToKill  ?a  ?v  ?g)))) 

((Solicit  '’a  '*b  7o)  “°a  solicited  7b's  sexual  favors  for  ?o." 

Constraint  (Isa  (’a  Person)  (Isa  7b  Person)  (Isa  ?o  Money)) 

1  Precondition  (CommPath  ?a  °b)) 

■  Motivation  (Believe  7a  (Occupation  ?b  prostitute))  (NeedSex  ?a)) 

'  Effect  (Believe  7b  (Solicit  7a  °b  7o))) 

Subaction  LMtrans  ?a  (Implies  (Coitus  7b  ?a)  (Atrans  '’a  7o  ?b  ?D)  '’b)) 

Links  (Motivation  (Subaction  1)  (Motivation  1))  (Motivation  (Subaction  1)  (Motivation  2)) 
(Precondition  (Subaction  1 )( Precondition  1 ))  (Effect  (Subaction  1)  (Effect  l)))) 

('Threaten  na  0b  (?p  ?a  .  ?r)  (?d  ?b  ’’!))  "?a  threatened  (?p  ?a  •  ?r)  unless  (?d  2b  .  71).“ 
(Constraint  (Isa  '’a  Character)  (Isa  7b  Character)) 

Precondition  (‘Pointer*  (7p  ’’a  .  7r)  Precondition)  ('Pointer*  (7d  7b  .  '*1)  Precondition) 

(Gcal  7b  (Not  (‘Pointer*  (7p  7a  .  ?r)  Effect)))) 

Effect  (‘Pointer*  l7d  ?b  .  71)  Effect)  (Not  (’Pointer*  (7d  7b  .  ?1)  NegEffect))) 

'  Subaction  :  Mtrans  7a  (Impiies  (Not  (?d  7b  .  7D)  (?p  7a  .  ?r))  ?b)  (7d  '’b  .  71)) 

'  Internal  ( Beiteve  7b  (Implies  (Not  (7d  7b  .  71))  (?p  7a  .  ?r)))) 

Links  'Precondition  (Subaction  2)  (Precondition  2))  (Motivation  (Subaction2)  (Precondition  3’) 
Motivation  (Subaction  2)  (  Internal  l))  (Effect  (Subaction  1)  (Internal  l)) 

■  Effect  :  Subaction  2)  (Effect  1 ))( Effect  (Subact ion  2)  (Effect  2D)) 
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C.2.  Inference  Rules 

((Seductive  ?x)  "°x  was  sexy." 

v Brules  ((Seductive  '’x)  <==  (Isa  7x  Skirt)  (Length  ?x  short)) 

((Seductive  ?x)  <—  (Isa  ?x  Jeans)  (Fit  ?x  tight)))) 

((CommPath  ?a  ?b)  "?a  had  a  communication  path  to  7b." 

(Brules  ((CommPath  ?a  ?b)  <==  (At  ?b  ?l)  (At  ?a  ?!)))) 

((Heir  ’’x  '*%■)  ”'>x  was  ?y’s  heir." 

(3rules  ((Heir  '’x  ?v)  <“  (Spouse  ?x  ?v)) 

((Heir  Tx  ?y)  <—  (Parent  7v  ?x)))) 

((Illegal  ?a)  “It  was  illegal  for  ?a." 

(Brules  ((Illegal  (Solicit  ?a  ?b  ?c))  <»«))) 

((LosPath  7a  ?b)  "'’a  had  a  line  of  sight  path  to  ?b." 

(Brules  ((LosPath  '’a  ?b)  <-»  (At  ?a  ?I)  (At  °b  ?l)))) 

((Believe  °x  7p)  "?x  believed  that  ?p.“ 

(Brules  ((Believe  ?x  (Occupation  7y  prostitute))  <-=  (Isa  ?1  Corner)! At  ?x  ?l)  (At  ?y  71) 
(Seductive  7c)  (Attire  ?y  ?c)))) 

((GoalPriority  ?a  I’d  ?f)  "'’a  wanted  7d  more  than  it  wanted  ?f.“ 

(Brules  ((GoalPriority  ?a  (Free  ?b)  (Possess  7a  ?o))  <-«  (PositivelPT  ?a  7b)))) 

((Spouse  '’a  9b)  "?a  was  7b's  spouse." 

(Brules  ((Spouse  ?a  ?b)  <—  (Spouse  ?b  ?a)) 

((Spouse  ?a  ?b)  <»=  (Married  ?a  ?b)))) 

((Possess  '’a  ?x)  "?a  had  ?x." 

(Brules  ((Possess  ?a  '’x)  <—  (Amount  ?y  ?g  ’’u)  (Possess  '’a  ?y)  (Amount  ?x  ?!  °u) 

(Isa  ?x  ?t)  (Isa  ?y  ?t)  (LessThan  ?1  7g)) 

((Possess  ?a  ?x)  <-»  (Holding  ?a  ?x)))) 

((ThemeGoalMet  °x  ?g)  "?x  was  happy  that  7g." 

(Brules  ((ThemeGoalMet  ?x  ( Possess  ?x  ?y))  <--  (Possess  ?x  7y)  (Isa  ?y  Valuable)) 
((ThemeGoalMet  ?a  (Insured  ?s  '’a  ?v  7c))  <-«  (Insured  '’s  ?a  ?v  ’c)) 
((ThemeGoalMet  ?a  (Arrested  nb  °d  '’a))  <»»  (Arrested  '’b  7d  ?a) 

(Occupation  7a  police-officer)))) 
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APPENDIX  D 


DETAILED  GENESIS  TRACE  FOR  THE  MURDER  EXAMPLE 


10_(  PrccessStorv  MURDER-T) 

Input:  (Martha  is  Gene's  mother  and  is  a  millionaire) 

Input:  (Isa  Personl  Person) 

Add  state:  Isal 

Input:  (Gender  Personl  female' 

Add  state:  Gender  1 

Input:  (Name  Personl  Martha) 

Add  state:  Namel 

Input:  (Isa  Person2  Person) 

Add  state:  Isa2 

Input:  (Gender  Person2  male) 

Add  state:  Gender2 

Input:  (Name  Person2  Gene) 

Add  state:  Name2 

Input:  (Isa  Money l  Money) 

Add  state:  Isa3 

Input.  (Amount  Mcneyl  order-millions  dollar) 

Add  state:  A  mount  1 

Input:  (Mother  Personl  ?erson2) 

Add  state:  Mother  1 

Inferring  Parent!:  'Parent  Personl  Person2) 

Inferring  PositiveiPTl:  (PosmvelPT  Personl  Per son2) 

Input:  (Possess  Personl  Money!) 

Add  state-  Possessl 

Input:  1  He  shot  her  and  she  died) 

Input:  'Shoot  Person2  Personl  ’'IN5TR6) 

Suggesting  ShootToKiill :(ShootToK.ill  Person2  Personl  "'1NSTR6 ) 

Suggesting:  Murderl :( Murder  Person2  Personl) 

Found  suggested  Shoot2:<  Shoot  Person2  Personl  "’INSTRoi 
matches  Shoot  1  1  Shoot  ?erson2  Personl  "'INSTR6) 

Assuming  suggested:  Possess2:( Possess  Person2  ’’INSTP.6 ) 
Assuming  suggested:  Shot2:>  Shot  Personl  ) 

Assuming  suggested  BadHealth  1 .(  BadHeallh  Person;) 

-.ssumtnt’  suggested:  BadHealth2  (BadHeaith  Person!  > 

Assuming  suggested.  Stated.!  State  'ol  ) 

Assuming  suggested  Statel  'State  Person!  ) 
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Y.O  Wl  W. 


I 


* 


K 

L 


-•.ssumnii’  suggested  GenesisActio.nl  (GenesisAction  Person2) 

Input.  i  Die  Person  1  i 

Found  suggested  Die2  (  Die  Persor.l) 
matches  Die3:(  Die  Person!) 

Assuming  suggested  Dead !:( Dead  Person  1) 

Assuming  suggested  DieldDie  Personl) 

Determining:  ShootToKilll  (ShootToKill  Person2  Personl  ’IN'SCRc) 

Looking  for  Precondition:!  Possess  Person2  ?!NSTR6) 

Unable  to  infer  (Possess  Person2  '’INSTR6! 

Precondition  not  found. 

Assuming  Possess3  (Possess  Person2  "’INSTR6) 

Looking  for  Constraint:! Isa  ?INSTR6  Gun) 

Constraint  not  found. 

Assuming  IsaU  (Isa  TMSTR6  Gun) 

Ejects: 

Shoi2  (Shot  Personl) 

Murder2.  (Murder  Person2  Person  1  linferrred  from: 

ShootToKilll:  (ShootToKill  Person2  Personl  "’INSTRo) 

Determining:  Murderl:  (Murder  Person2  Personl) 

Looking  for  Constraint:' Isa  Personl  Person) 

Found  Isa  1 :( Isa  Personl  Person) 

Looking  for  Constraint:!  Isa  Person2  Character) 
isa8:  (Isa  ?erson2  Character linferrred  from: 

Isa2  (Isa  Person2  Person) 

Ejects: 

Deadl:  'Dead  Personl) 

C-enesisAction2.  (Ger.esisAction  Person2  '\9  linferrred  :  rom 
Shoot2:  (Shoot  Personl  Person!  'TNSTR6) 

State3:  (State  Personl  linferrred  from: 

Shot2:  (Shot  Personl) 

Stated:  (State  Personl  linferrred  from 
Shoi2:  (Shot  Person!) 

input:  (Gene  got  SIOOOCOO) 

input:  (Isa  Monev2  Money; 

Add  state;  isalO 

input.  (Amount  Money!  1000000  dollar) 

Add  state:  Amount2 

input:  'Possess  Person2  Money2  ) 

Add  state  Possess-t 
Unable  to  infer  (Isa  Money 2  Gun) 

Ready  for  questions: 
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Input  i  Claud. us  evas  Agrippina  s  husband  and  owned  an  estate1 


Input  'Isa  Person!  Person) 
Add  slate  Isal 


Input:  'Gender  Personl  maie) 


Add  state:  Genderl 


Input.  (Name  Personl  Claudius) 
Add  state:  Namel 


Input:  'Isa  Pe:scn2  Person) 
Add  state-  Isa2 


Input:  :  Gender  Personl  female) 
Add  state:  Gender2 


Input.  (Name  Person2  Agrippina) 
Add  state  Name2 


Input.  (Isa  Estatel  Estate) 
Add  state  Isa3 


Input-  (Husband  Personl  Person2) 

Add  state:  Husbanal 

Inferring  Spousel  (Spouse  Personl  Person2) 

Inferrtnc  PositivelPTl  (PositivelPT  Personl  Person2> 


Input.  ■  Possess  Personl  Estate!  ■ 
Add  state  Possessl 


Input-  '  Agrippina  gave  him  a  poisonous  mushroom  and  he  died) 


Input,  t  Poisonous  Mushroom!  » 
Aid  state:  Poisonous  1 


Input  Isa  Mushrocml  Mushroom 
Add  state  Isa- 


Input  A'.rans  ?ersor.2  Mushrooml  Personl  "’AT6) 


Pound  Poisonousl.  (  Poisonous  Mushroom  I ) 
Isaf  'Isa  Mushrooml  Food )int errred  from 
Isa-i  (Isa  Mushrooml  Mushroom1 


Succestir.t’  Poison  1 .( Poison  Person.2  Personl  Mushrooml) 


Sucffstinif  Murder!  (  Murder  ?erson2  Personl) 


round  suggested  Atrans2:(  Atrar.s  P?rson2  Mushrooml  Person!  ~".l 
matches  A trar.sl ,( Atrans  Person2  Mushrooml  Person!  'AT6  ’ 
•ssummc  su?gested:  PossessS  'Possess  Person2  Mushroom)  ) 
Pssumint;  succested:  Not2  Not  (Possess  Person2  Mushroomi)) 
•ssummc  suggested  Possess-^  1  Possess  Personl  Mushrooml  . 
v.'UT.nc  suggested:  State!  State  Person]  ) 
viVir.ir.i'  suggested.  Genesis  Action  1  Genesis.Acticn  Person!) 


Input  Die  ? e r - o n  1 
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Found  suggested  Die2:tDie  Personl) 
matches  DieodDie  Personl) 

Assuming  suggested-  3adHealth2:(  BadHealth  Personl) 

Assuming  suggested.  State2:(State  ",o2) 

Assuming  suggested  Insidel .( Inside  Mushrooml  Personl) 

Assuming  suggested:  Ingest  1 :( Ingest  Personl  Mushrooml) 

Assuming  suggested:  N'eedSustenancel:(NeedSustenance  Personl  ) 
Assuming  suggested:  lsa7:(Isa  Mushrooml  Food) 

Assuming  suggested.  Not3:(Not  (Possess  Personl  Mushrooml)) 
Assuming  suggested:  BadHea!ihl:(BadHealth  Personl) 

Assuming  suggested.  Poisonous2:(Poisonous  Mushrooml) 

Assuming  suggested.  Dead T( Dead  Personl) 

Assuming  suggested:  DieldDie  Personl) 

Determining:  Poisonl:  (Poison  Person2  Personl  Mushrooml) 

Looking  for  Constraint:* Isa  Mushrooml  Food) 

Found  Isa5:(Isa  Mushrooml  Food) 

Looking  for  Constraint:* Poisonous  Mushrooml) 

Found  Potsonousl:( Poisonous  Mushrooml) 

Looking  for  Precondition:! NeedSustenance  Personl) 

Unable  to  infer  (NeedSustenance  Personl ) 

Precondition  not  found, 

Assuming  NeedSustenance2:  (NeedSustenance  Personl) 

Looking  for  Precondition:* Possess  Person2  Mushrooml) 

Unable  to  infer  (Possess  Person2  Mushrooml) 

Precondition  not  found. 

Assuming  Possess5  (Possess  Person2  Mushrooml) 

Effects: 

Insidel:  (inside  Mushrooml  Personl) 

N'ot2:  (Not  (Possess  Person2  Mushrooml)) 

Not2  deletes  Possess*:  (Possess  Person2  Mushrooml) 

N'ot3:  (Not  (Possess  Personl  Mushrooml)) 

Not3  deletes  Possess-!:  (Possess  Personl  Mushrooml  ) 

Murder2:  'Murder  Person2  Personl  linferrred  from: 

Potsonl:  (Poison  person.  Personl  Mushrooml) 

Determining:  Murderl:  (Murder  Person2  Personl) 

Looking  for  Constraint:!  Isa  Person!  Person) 

Found  Isaldlsa  Personl  Person) 

Looking  for  Constraint:!  Isa  Person2  Character) 

Isa'.C:  (Isa  Person2  CharacterJinferrred  from: 

Isa2:  (Isa  Person2  Person) 

Ejects. 

Deadl:  (Dead  Personl) 

Genesis Action2:  (GenesisAction  Person2  .  ?x  13 linferrred  from: 

Atrans2:  (Atrans  Person2  Mushrooml  Personl  ?12) 

S:ate3:  (State  Personl  linferrred  from: 

PossessJ:  (possess  Personl  Mushrooml) 

State-l.  ( State  Mushrooml  linferrred  from: 
insidel.  ( inside  Mushrooml  Personl) 


(  She  inherited  the  estate) 
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..:  inherit  Perso r.2  Estate:  ’FROM:1 
.'reate  action.  Inherit  1 
riur.d  Isul  'Isa  Persor.l  Person! 

I_ooic:ng  for  Precondition  ( Dead  TROMo; 
round  Z)eadl:(Dead  Persor.l. > 

Looking  for  Precondition.!  Possess  Person  1  Estatel) 
Found  Possess  1:  Possess  PersonI  Estatel) 


V 


& 


Look  ms  tor  Constraint  (Isa  Person!  Person) 

Found  Isa I. (Isa  PersonI  Person) 

Loo.sine  tor  Constraint. i  Isa  Person2  Character) 

Found  IsaiO.tlsa  Person!  Character) 

Looking  for  Precondition:! Heir  Person!  PersonI) 
Heir!.  (Heir  Person!  PersonI  Hnferrred  from: 

Spoused:  (Spouse  Person!  PersonI  Jinferrred  from: 
Spousel-  (Spouse  PersonI  Person!) 

Inferring  PositivelPT!:  (PositivelPT  Person!  PersonI) 


r: 


Looking  for  Constraint:(Isa  Estatel  Inanimate) 
Isa  1 5 :  (Isa  Estatel  Inammaie)inferrred  from: 
Isal"  (Isa  Estatel  Valuable)infe:rred  from: 
Isa3:  (Isa  Estatel  Estate) 


1 


Effects: 

Possess?:  (Possess  Person!  Estatel ) 

N'oto:  (Not  (Possess  PersonI  Estatel)) 

Note  deletes  Possess!:  (Possess  PersonI  Estatel) 

ThemeGoalMetl:  (ThemeGoalMei  Person!  (Possess  Person!  Estate!))  inferrred  from: 
Isa!"-  Isa  Estatel  Valuable) 

Possess?:  (Possess  Person!  Estatel  > 


eg 


Thematic  goal  achieved:  ThemeGoal.Met  1 ;( ThemeGoalMei  Person!  (Possess  Person!  Estate!)) 
Explanation  suitable  for  -’eneralization. 

Pruning.. 

Prunine  unit-Isal”  ((Isa  Estatel  Valuable) 

<  == 

( Isa  Estatel  Estate)) 

Pruning  umt  lsaiO  ((Isa  Person!  Character) 

'Isa  Person!  Person)) 

Prunm:  unit  Isa;  i  (Isa  Mushrooml  Food) 

<  *« 

(Isa  Mushrooml  Mushroom)) 

Prunm;  unit. Murder!  ((Murder  Person!  PersonI  ) 

<  =« 

(Poison  Person!  PersonI  Mushrooml)) 

Prunm:  mit  Spousel  (( Spouse  Personi  Person!) 

< 

(Husband  Personi  Person!)) 

Prunm:  unit  Spoused  '(Spouse  Person!  Personi) 

<  =  = 

i  Spouse  Personi  Person!') 

Prunm:  mil  Heir!  'Heir  Person!  Personi  > 


< 

I 


) 


I 

■ 


s 


a 


■  Spouse  Person2  Person i  ') 


Generalitting... 

Packaging. 

In  general  Not6  deletes  Possess?  when  T 
Creating  Neu  Schema:  (Murderlnherit  ?a5  ’’o4  Ml) 

Schema  summaryhMurderlnherit  '’aS  "’o4  'dii 
~a5  is  a  character. 

"’d  1  is  a  person. 

^aq  murders  ~’d  1 . 

~o4  :s  a  valuable. 

"'dl  has  ~c4 
’’a 5  is  ?d  1  's  heir. 

?a5  inherits  ?dl's  ?o4. 

Having:  (Murder  ?a5  ?dl)  suggest  Murderlnherit 
Ready  for  questions: 

> 

1 2_>  ProcessStory  ML'RDER-T) 

Input:  (Martha  is  Gene's  mother  and  is  a  millionaire! 

Input:  (Isa  Personl  Person) 

Add  state:  Isal 

Input:  (C-ender  Personl  female) 

Add  state:  Gender  1 

Input:  (Name  Personl  Martha) 

Add  state  N'amei 

Input:  (Isa  Persor.2  Person) 

Add  state:  Isa2 

Input:  ‘Gender  Person2  male) 

Add  state:  Gender2 

input:  (  Name  ?erson2  Genei 
Add  state  Name2 

Input:  !  Isa  Money  1  Money) 

Add  state  Isa3 

Input.  (Amount  Money!  order-millions  dollar ; 

Add  state  A  mount  1 

Input.  (Mother  Personl  ?erson2) 

Add  state  Mother! 

Inferring  Parentl  (Parent  Person!  ?erson2) 

Inferring  PositivelPTl.  (PosttivelPT  Personl  Person2) 

input  Possess  Person!  Monevl) 

-.do  state  Possess! 

Input  He  snct  her  and  she  died) 


»*•  ♦ 


I 


i  ir.oo:  ;  erscnd  rersor,:  ..no.  r.o 


Suc-destine-  ShoctToKillI.  (ShootToKil!  Person!  Person!  "'IN'CRo 

Suggesting  Murder  1 :!  Murder  Person!  Person  1  ! 

Found  suggested  Shoot! :( Shoot  Person!  Personl  dN'S7R6) 
matches  Shoot!  .(Shoot  Person!  Person!  *'IN'STR6) 

Assuming  su^jested.  Possess!:!  Possess  Person!  "’IN'STRS) 

Assu mins  suggested  Shot!  !S!tot  Person!) 

Assuming  suggested.  3adHeaithI  <  SadKealth  Personl) 

Assuming  suggested  BadHeaith!1  3adHealth  Personl  1 
Assuming  suggested.  State!:!  State  Ml  ) 

Assuming  suggested  State! .( State  Personl  i 

Assuming  suggested  Genesis  Action!.  (GenesisAction  Person!) 

Input.  (  Die  Personl ) 

Found  suggested  Die!  ( Die  Personl! 
matches  Die!  <  Die  Personl) 

Assuming  suggested:  Dead1.:! Dead  Person!) 

Assuming  suggested  DielhDie  Personl  ) 

Determining-  ShootTcXill  1  ' ShootToXill  Person!  Personl  7INSTR6) 

Looking  for  Precondition:!  Possess  Person!  '’INS7R6) 

L'r.abie  to  infer  (Possess  Person!  'INSTR6) 

Precondition  not  found. 

Assuming  Possess!:  ''Possess  Person!  7INSTR6) 

Looking  for  Constraint:!  Isa  DNS7R6  Gun) 

Constraint  not  found. 

Assuming  Isa 7 :  (Isa  '’IN’STRh  Gun) 

Ejects: 

Shot!:  1  Shot  Personl  ) 

Murder!:  (Murder  Person!  Personl  hnferrred  from: 

Shoot7oKillI:  !  ShootCoKiIi  Person!  Personl  '’IMS7R6) 

Determining-  Murder!:  (Murder  Person!  Personl) 

Isa?:  'Isa  Person!  Character linferrred  from: 

Isa!  lisa  Person!  Perscm 

Found  Isa  1  :<  Isa  Person!  Person: 

Suggesting:  Murderlnherit  1 :(  Murderlnhern  Person!  ?oJ-2  Personl) 

Found  su; tested  Murder3:( Murder  Person!  Personl) 
match.es  Murder!:( Murder  Person!  Personl  > 

Assuming  suggested  Isa  10!  Isa  Person!  Character) 

-.ssuminf  sucrested:  Isall.dsa  Personl  Person) 

-.ssumme  suggested.  Dead!:!  Dead  Personl  ) 

GenesisAction!;  ( GenesisAction  Person!  .  "AS  linferrred  from: 

Shoot!:  (Shoot  Person!  Personl  ‘’1NS7R6) 

State!  'State  Personl  linferrred  from: 

Shot!.  'Shot  Person!  i 
Stated  'State  Personl  hnferrred  from 
Shot!  Shot  Personl  . 

Incut  Gene 2V\  $ 1 GOOOCO • 
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Input  '  Isa  Money  2  More-  ) 

Aua  state  Isa :  - 

Input-  l  Amour.:  Money!  IvOlOC-O  dollar ) 

Aad  stare  Amount! 

Input.  (Possess  ?erson2  Money’! 

Add  state  Possesso 
Unable  to  infer  (!sa  Money!  Gun) 

'.sal5  (Isa  Money!  Valuablelinferrred  from: 

Isa’.!:  (Isa  Money!  Money) 

Found  suggested  Possessfh  Possess  Person!  "o4-!) 
matches  Possesso :(  Possess  Person!  Monev!) 

Assuming  suggested:  Inher:tl:(Inherit  Person!  Money!  Persor.l! 
Assuming  suggested:  HeirMHeir  Person!  Personl) 

Assuming  suggested:  Possess4:( Possess  Personl  Money!) 

Assuming  suggested:  Isal3:llsa  Money!  Inanimate) 

Assuming  suggested:  Isal!:(Isa  Money!  Valuable) 

.Assuming  suggested:  NotldNot  (Possess  Personl  Money!)) 
Determining:  Murderlnheritl:  (Murderlnherit  Person!  Money!  Personl: 

Looking  for  Constraint  !  Isa  Money!  Valuable) 

Found  IsalS:(Isa  Monev!  Valuable) 


Looking  for  Constraint:!  Isa  Personl  Person) 
Found  IsaUUsa  Personl  Person) 


Looking  for  Constraint:!  Isa  Person!  Character) 

Found  IsaShlsa  Person!  Character) 

Looking  for  Precondition:*  Possess  Personl  Money!) 

Add  state:  LessThanl 

Pcssess8:  (Possess  Personl  Money! )inferrred  from: 

LessThanl:  tLessThan  1000000  order-millions) 

Isa3.  ( isa  Money  1  Money) 

Isa  14:  (Isa  Money!  Money) 

Amount!:  (Amount  Money!  1000000  dollar) 

Possess  1:  (Possess  Personl  Money  1) 

Amount’:  (Amount  Moneyl  order-millions  dollar) 

ThemeGoal.Metl;  (ThemeGoalMet  Personl  (Possess  Personi  Money!))  inferrred  from: 
Isalf-  (isa  Money!  Valuable) 

Possess8:  (Possess  Personl  Money!) 

Looking  for  Precondition:!  Heir  Person!  Personl) 

Heir!  'Heir  Person!  Personl  Jinferrred  from: 

Parent  1:  (Parent  Persor.l  Person!) 

Ejects: 

Deadl:  (Dead  Personl) 

Notl:  (No:  (Possess  Personl  Money!)) 

Noil  deletes  PossessS:  (possess  Personl  Money!) 

PcssessS:  (Possess  Personl  Money!) 

7h.emeGoai.Met2:  (ThemeGoalMet  Personl  (Possess  Personl  Merer!))  inferrred  from: 
Isalf.  (Isa  Monev!  Vaiuabie) 

Possess^:  (possess  Person!  Money!) 

hematic  ;Gal  achieved:  ThemeGcaiMe:2.(ThemeOoai.Met  Person!  (Possess  Person!  Monev!)) 
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